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ABSTRACT 


0|)i.imizal,i()n is an ini|)oi t,ant issiit' in a design puxess. 'riieio fU(' many l,iciclil,it>iia,l 
iiu'iliods for o|)iiniizaiion, hut, they have sonic inherent drawbacks. Genetic algo- 
rithms arc relatively new search and optimization techniques which are hotter at 
handling some of these difficulties. Usually, a genetic string is defined through bi- 
narization of the design variables called binary coding. The success of binary-coded 
genetic algorithms (GAs) in problems having discrete search space largely depends on 
the coding of the variables and on the crossover operator that propagates building- 
blocks from parent strings to children strings. In solving optimization problcnis hav- 
ing continuous search space, binary-coded GAs discretize the search space by using a 
coding of the problem variables in binary strings. However, the coding of leal-valued 
vatiables in finite-length strings causes a number of difficulties — inability to achieve 
arbitraiy pu'clsioii in tlu' obtained solution, inlu'rent Hamming cliff ptobhun a.sso- 
cia.ted with the binary c.odiug, a.ud puxessing of Holland’s scluMnata, in continuous 
search space;. Although, a number of real-coth'd GAs have; been devedoped to solve 
optimization problems having a continuous search space, tlie seaicli poweis of tlu'se 
crossovei ofierators are not adequate. The seaich power of a crossover operator can be 
defined in terms of the iirobability of creating an ai bitrary child solution fiom a given 
pair of patent solutions. Motivated by the succcs.s of binaiy-coded GAs in discrete 
search space problems, in this thesis, we develop a real-coded crossover (we called it 
simulated binary crossover or SBX) operator whose search power is similar to that of 
the single-point crossover used in binary-coded GAs. Simulation results on a number 
of real-valued test problems of varying difficulty and dimensionality suggest that the 
real-cod('d GAs with the SBX operator arc able to perform as good or better than 
binary-coded GAs with the single-point crossover. SBX is found to be pnrticulai ly 
iiitefni ill piuhleniM lutviii;', nnill.iple eptiimd .'inhil.iiinM wil.li n luniew |dnb;d luiiiiii juid 
in problem, s where the lower a, ml up|)er Ixninds of tliegloba.1 ojrtimum are not known 
a priori. Further, the real-coded GAs with SBX are able to successfidly find the global 
o])timum solution in a blocked function, whicb causes problems to other real-coded 
GAs. 

In the second part of this work, wc demonstrate the use of GAs with SBX on a design 


application : ambiguous shape modelling. Present CAD systems use quantitative 
models for representing the object geometries, which are not suitable for applications 
such as conceptual design, shape abstraction, qualitative pattern recognition etc., 
where one needs to represent not one, but a class of geometries. One novel mechanism 
for ambiguous shape modelling is based on qualitative mapping on the parameters of 
the axis of the shape. However, this approach presents some diflBculties in extracting 
reconstructable shapes in a generalized domain, because of the problems such as 
simultaneous multiple constraint satisfaction, and generation of inconsistent shapes. 
This can be improved by using a hybrid qualitative model with variable zones, but 
generating “desirable” shapes remains a problem. In this thesis, a GA based approach 
is used to handle these difficulties. SBX has been applied on the parameters of a 
hybrid qualitative system. The use of GAs makes it possible to proffide a visual 
display to the user (inexact visualization) and to arrive at a number of different shapes 
satisfying some constraints to different degrees (diversity). Moreover, the fitness 
criterion for the GA can be defined such that infeasible shapes get eliminated and 
more desirable shapes get preference. Some test shapes generated by this system are 
presented in the thesis, which demonstrate the viability of the system. 
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Chapter 1 

Introduction 


A (locisioii |)ioco,ss often involves sotting of some derision variables in order to arliieve {orta.in 
objertives within availabb* K'sontres a.nd ronstraiiits. h'or example, a. ik'w product may Ix' 
needed to bo designed in such a way so as to minimize its production cost and maximize its 
marketability; a manufacturing process must be planned so as to minimize the total process 
time; a satellite should be launched into a path which minimizes total energy consumption. 
All these arc examples of optimization, which deals with finding a set of decision variables such 
that an objective function is maximized (or minimized). 

Optimization is a well studied area and various algorithms for optimization are availal)ie 
(Hao, I9!)l). (IcMU'tir Algoritlims ((lAs) aie one' of the' recent terlinicpic's used lor optimization 
and are gaining popularity both in academies and in indiistiy because of certain unique features 
which a.ie discussed in tiie next chapter. 

1'yi)ically, optimization is accompanied by constraint satisfaction v/Wich relates to sea, idling 
a, solution satisfying some prc'definc'd ronslraints. 'J'liis is anotliei area where extensive researdi 
has hc’en done. Traditionally, optimization and constraint Ra.tisfa.ctioii are consideiod to lie two 
iiulejicndent fields and algorithms for them have been developed separately. However, standard 
optimization techniques are slow in constraint satisfaction and standard constraint satisfaction 
toduiiques arc inclficient in optimization. This is one of the serious problems in design process. 
It is inteiestiiig to note that CAs can be usc'd foi ‘constraint satisfaction’ auicl ‘c)]>timizatioir 
both, lienee, in this tiiesis, we focus on GAs as a seareli and optimization tool. I’o motivate 
tliis cliscuHsion, we provide an example. 

IjC't ns consider a specific task, lliat of dc'sigiiing a knife cross-section. We bc'gin with a 
class of possible shapes for cross section (Fig. 1.1). Note the progressive refinement done by 
dosigiiei on the basic sliape. In the first case, he finds the cutting edge is very thin, leading 
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to high wear rates. So he refines the cutting edge in the next design. There may be other 
constraints that need to be '‘satisfied’’, e.g. aspect ratio of the shape, which might be needed 
to fall within some range. When the designer has roughly decided the shape, he may wish to 
‘optimize’ the shape attributes, e.g. cutting angle and bevel thickness. This example depicts 
that a design process normally involves optimization as well as constraint satisfaction. A. GA 
based approach can be used to handle both of these aspects. 



Figure 1.1: Designing the cross-section of a knife. Note the basic similarity in the shapes. 

We begin the thesis in chapter 2 by an introduction to GAs. That chapter also addresses 
the problems associated with the present approach of GA (specifically, the binary GA) when 
applied to continuous search domain. To combat these problems, some researchers advocate 
use of real-coded GAs (Davis, 1991a; Wright. 1991; Radclilfe. 1993: Eshelman and schaffer, 
1993). A real-coded GA is different than standard GA in the sense that it uses floating-point 
numbers to represent the variables in contrast to binary strings used by standard GA. Since 
crossover operator is the main search operator in working of GAs. the main emphasis given 
by different researchers is on this operator only. A number of real-coded GA crossovers have 
been reported in the literature (Wright, 1991: Eshelman and Schaffer. 1993), but the search 
powers of these operators are not found to be adequate in a wide variety of problems. In 
this thesis, we have devised a new crossover operator for real-coded GA termed as simulated 
binary crossover (SBX), which has much more flexibility in terms of search power. The most 
striking feature of this crossover is that it simulates the effect of standard binary crossover in 
a real-coded variables’ domain. 
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Chapter 3 presents the results of GAs with SBX operator on a number of test problems. 
Most of these test problems are chosen from the literature, and cover a variety of functions 
— from one-variable to multivariable, unimodal to multimodal, continuous to discrete, and 
smooth to noisy. The obtained results are encouraging and show that SBX has performed 
either better than or similar to binary crossover operator. 

Gaming the confidence in the performance of SBX on several test problems, we then apply 
SBX to ambiguous shape modelling in chapter 4. Even though this has been taken only as an 
apphcation problem for real-coded GA. it is important in its own. Note that even though the 
shapes in fig. 1.1 are different from each other, there is some basic similarity in all the shapes 
(length is significantly more than the width, and width reduces from one end to otherl. This 
type of shape abstraction is what a designer will hke to keep in his model. Almost all the present 
CAD models are quantitative in nature, storing exact information about the shape. However, 
such models are of little use at conceptual stage of design, when the designer hasn't fixed up 
the shape, but still wants to try various options. For this, a qualitative model of the shape is 
reqmred, which should be able to store not just a single shape, but a class of shapes. The model 
developed here uses a novel mechanism for representing the class of shapes which is based on 
‘smudging’ the axis of a shape (also known as medial axis transform or MAT) proposed by 
King and Mukerjee (1990). One problem with tliis approach is that this mechanism sometimes 
generate inconsistent shapes. In this thesis, we use the same mechanism, coupled with G-A.S 
to eliminate inconsistent shapes and to find shapes with more desirable attributes. For this 
purpose, various penalty functions have been defined relating to properties of shapes generated. 
GA with SBX is then applied to eliminate inconsistent shapes and generate desirable shapes. 
Use of GA makes it possible to search non-linearly in a highly complex domain of search space, 
for the solutions which will be otherwise almost impossible to obtain by traditional methods. 
The test results obtained present a confidence about viability of the system for ambiguous 
shape modelling 

discussion on various possible avenues of further research on the current topic is outlined 
in chapter 5. The final chapter discusses a summary of the work performed in this thesis. 



Chapter 2 


Genetic Algorithms : Development 
of SBX 


Optimization is an important issue in a decision process. In its most general sense, ‘opti- 
mization’ means maximizing (or minimizing) an objective by setting appropriate values for 
its decision variables. Most of the standard optimization algorithms start with a imtal guess 
point, and then the solution gets updated iteratively depending upon some transition rules 
(Rao.i991). These algorithms have been extensively used in real-world applications. How- 
ever, there are mainly two drawbacks associated with the traditional optimization techniques. 
Firstly, they are not robust, in the sense that a particular algorithm suits only for a limited 
class of optimization problems. Thus, the user needs to know a number of algorithms and their 
respective domain of applicability. It does not harm to know about a number of optimization 
algorithms, but it is definitely better to know one robust algorithm applicable to many prob- 
lems. Secondly, most of the traditional optimization techniques are local-search algorithms. 
Consider, for example, the function shown in fig. 2.1. If the starting point is chosen from 
the range the algorithm is likely to converge at point L. which is a local optimum. The 
global optimum G can only be achieved if we start with a point in the range BC. In real- 
world situations, the objective functions are much more complex both in dimensionality and 
behaviour, and it may be almost impossible to select a suitable starting point to reach to the 
global solution. In the next section, we look at a very different type of optimization technique 
— genetic algorithm — which is somewhat more efficient in handling the above difficulties. 

2.1 Genetic algorithms 

Genetic algorithms ( G As ) are nontraditional search and optimization techniques based on an 
analogy with natural genetics (Goldberg, 19H91. They have been applied successfully into a 
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Figure 2.1: A simple bimodal function. Points L and G denote local and global optima, 
respectively. 

wide variety of problems, and have yielded results which were either not possible or were very 
diflFicult to achieve using traditional methods. There are some salient features of GAs which 
put them into an altogether different class from traditional methods. These are as follows ■ 

1. GAs have global perspective, i.e.. they don’t easily get stuck in local optima. 

2. GAs are robust techniques, i.e, they can be applied to many classes of optimization 
problems. 

3. GAs use a population of solution points, and not a single point. 

4. GAs use probabilistic rules for refinement of solution points and not the deterministic 
rules. 

5. GAs don’t require gradient or any other auxiliary information about the problem. 

6. GAs usually work with a coding of variables and not the variables themselves^. 

A full description of working of GAs can be found in ( Goldberg, 1989). However, we present 
here a brief overview of it’s working mechanism for the sake of continuity, 

2.1.1 Working of simple genetic algorithm 

To explain the working of GAs, we first consider a one-^•ariable unconstrained function : 

Maximize f{x) for Xmm < ^ < ^max 

'However, this feature of GA is not absolutely necessary In a later section and in most part of this thesis 
we show how the coding can be avoided. 
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All oxUiiiHioii [()! iiiulli-vatiahlc pioblcms will be discussed later. In order to solve tlic above 
problem using CAs, the variable x is typically coded in .some string structures. Binary coding 
is normally used aiul the decoded value of string is linearly mapped on the vaiia.b!e langt' 
The length of the string is usually determined according to the desired precision level of the 
solution. Thus the precision level acquired by using f-bit long string will be {xmnx - 
With a known coding, any string can be decoded to an x value, which can then be used to find 
the objective function value. A string’s objective function value (/(rj) is termed as string’s 
fitness in GA terminology. 

initializeO ; 

evaluationO ; 

while (termination criteria are unsatisfied) 

f 

reproductionO ; 
crossoverC ) ; 
mutation () ; 
evaluationO ; 

> 

l''igiiie 2.2: A p.seiido code for simple genetic algorithm. 

The working of GAs can be bi’oadly illustrated by fig. 2.2. First, a population of randomly- 
generated strings is cieatr'd. 'I'he number of strings in a population is tailed pnpnlntion size 
and usually supplied by the user. For more cornirlex problems, larger population sizes arc 
iioiiiially list'd. Once tht' ptipiilatitrn is initialized, eath stiing in the population is evaluated 
and assignetl it’s fitness value, which is a measure of it’s goodness For a, maximization 
problem, usually the olrjeclive function value itself is treated as string’s fitness value. After 
this, the three main operators — reproduction, crossover and mutation — are applied on the 
population and it is again evaluated. This is then tested for tennination criteria. If they are 
not met, the population is modified again using the same operators and then evaluated. This 
process is continued till termination criteria are satisfied. We now take a closer look at the 
three o])erators ; 

• Reproduction : This operator selects better strings from the population and sends copies 
of these strings in a temporary population called mating pool. This is analogous to 
Darwin’s survival of the fittest principle. Usually a probabilistic approach is used to 
select the strings, which results in selection of bad strings once in a while (although 



12 


wiUi a small i)robability). At first sight, it sounds awkward, but a careful thinking will 
ronvinre that this is necessary if wo want GA not to got stuck in local optima. There arc 
a number of reproduction schemes which can be used for this purpose, e.g. proportionate 
selection, tournament selection, and stochastic remainder selection (Goldberg and Deb, 
1991). 

• Crossover : The reproduction operator can’t generate new solution points. This job is 
taken caie by crossover. Crossover opeiator is analogous to recombination of genes of 
parents in biological systems. According to Deb (1993) : 

. . . the purpose of the crossover operator is two-fold. The main purpose of the 
crossovei operator is to .scaich the parameter space. Other aspect is that the 
search needs to be performed in a. way that the information stored in the parent 
stiiiigs ai(' ma.ximaily pie.served, because those parent strings are inslance.s of 
good strings selected using the reproduction operator. 

In one of the vaiioiis po.ssible rro.s,sovers (Spears and De .long, 1991), single point 
crossover, a random < ros.s-site is clio.seii along the length of string, and the bits on either 
sides of cross-site are swapped between the parent strings. In section 2.2, we will lie 
dealing with this cro.ssover in detail. 

• Mutation : It is considered to be a secondary operator in GAs, and its main function 
is to guard the population against premature convergence. Mutation Hips a randomly 
chosen bit ol a, string liom i to 0 (or vicc-veisa). Usually, mutation is applied with a 
very small mutation probability (p„i), .so that only very few strings undergo this flipping. 
To iindeiutaiid the necessity of miital,ion, coiisidei a, particular casi' in which the last bit 
of the string at global solution is 1. Now, snjipose that all the strings in a geneiatioii 
have a 0 in that place, then neither reproduction nor crossover can generate the global 
solution. Hut introduction of mutation makes it possible (though not guaranteed) to Hip 
the last bit from 0 to 1. 

For a multivariable problem, all strings corresponding to each variable are concatenated to 
form a bigger string and then this string is used for GA search. It looks surprising, but it is 
true that the three, random and simple-looking operators described above constitute a highly 
nonlinear and robust search and optimization paradigm. In the next section, wc look at some 
intutivc reasoning behind the working of GAs. 
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2.1.2 Similarity templates (schemata) 

TIu' main kchioI, of.succc'ss o( Uinary CiA i.s (,lip [)ro( ossing of .sr/irmn/o ( plural n{ firhcuxi) wiii(li 
goes on in parallel with the operations of GAs. A schema is a similarity template (Holland, 
1975) and is represented by combination of three symbols — 1,0 and * (don’t care). In general 
a scltctna rcpiescnts a class of strings. For instance, a schema (1 + + + 0) means all those 5-bit 
strings which have a 1 in the first place and a 0 in final j)laco. In the variable space, a schcmia 
represents a region which may be or may not be continuous. For example, schema (1 t + 4= 
+) represents the right half of the search space, whereas schema (+ + + + 1) represents every 
alternate string. It has been shown elsewhere (Goldberg, 1989) that those schemata, which are 
good, will be getting exponentially higher nnmbor of copies in forthcoming genera, tions and 
those which are poor will slowly die out. 'I'liis phenomenon, of schemata being processed in 
parallel is known as implicil parallelism in GAs and is one of the important underlying theories 
of GAs. 

2.1.3 Shortcomings of binary GA 

Hinary GA has been popultirly used to solve both types of problems having discrete or contin- 
uous variables. The success of binary-coded GAs in problems having discrete .sc'arch space is 
la,rgoly duo to the careful choice of a coding and crossover operator that process useful schemata 
efficiently and propogate it in successive generations. However there are certain shortcomings 
associated with tlie biiiaiy coding, especially when apiilied to continuous domain. 'These ca,n 
be summarized as below ; 

1. Artificial hindcrance to search created by Hamming cliff 

Hamming eJiff (Davis, 1991b) refers to sudden changes in genic space with a gradual 
change in variable space. For instance, even though the two strings 011111 and 100000 
are separated by only one unit in variable space, transition from one string to other in 
genic space requires change in all the bits. Thus if the global solution is at lOOOOO and 
the poi)ulation converges to 011111, it will be very difficult to reach the solution even by 
using mutation. Some reseai’cheis advocate u.se of gray coding (Scha,frer et.a.l., 1989) to 
avoid this problem, but that is merely a substitution of one problem for another, becau.so 
use of gray coding introduces artificial non-linearity in the problem. 
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2. Fixed precision level 

The precision acquired in the solution is fixed in binary GA because it depends on the 
string length specified which should be decided before-hand. The more is the required 
precision, more should be string length. For larger string lengths, the population size 
requirement is large (Goldberg, Deb and Clark. 1992), thereby increasing the compu- 
tational overheads. Even with a reasonable string length, there is always the danger 
that one has not allowed enough precision to represent parameter values that produce 
the best solution values. Further, a careful thinking reveals that binary GA gives equal 
importance to all the regions in search space as far as precision is concerned, whereas 
in an ideal algorithm, the precision level should be dependent on fitness of regions, i.e. 
more precision for better regions, and less precision for bad ones. 

3. Fixed variable bounds 

In a binary-GA, the bounds of every design variables are fixed before-hand, to facilitate 
mapping from binary coding to variable-space. In certain real-world applications, the 
user may not be knowing these bounds. If a narrow bound is specified, there is always 
a danger that the global optimum is not enclosed in the specified bounds. On the other 
hand, a too-wide range of variables require a larger string length (if precision level is 
to be kept the same) which increases the risk of premature convergence. In an ideal 
algorithm, the search should be extended to practically anywhere for an unconstrained 
optimization problem. 

4. Processing of unimportant schemata 

There are a large numbers of Holland’s schemata that are processed during a GA search. 
If we think about this processing a bit closely, we see that not all schemata that are 
processed are important. In a continuous search space, the meaningful schemata are those 
which represent continuous regions of the search space. For example, the schema (11 + 
+ +) represents the rightmost quarter of the search space, so it may he meaningful. But 
the schema (+ * + + 1) represents every alternate point in search space. This may not 
be of any importance in continuous domains. Hence, the crossover needs to be redesigned 
in order to process more meaningful schemata relevant to continuous search space. 


Realizing the above aspects, a growing number of researchers have started paying attention 
to real-coded GAs which use parameter values themselves instead of any coding. Obviously, 



this type of GA requires crossover and mutation operators which are radically different from 
their coiiiil.erparfs in binary GA. In .section 2.3 we give a brief review of existing cros.sovers for 
real-coded GAs a,n(l section 2.4 deals with a new t.ype of real-coded crossovt'i, SHX, fh'velojx'd 
in this thesis. The real-coded GA using SBX is free from the shortcomings of binary GA 
discusH('(l above. At the same time, SBX simulates the beneficial aspects of binary crossover. 
But belore we discuss that, we need to look at search power of binary crossover, which is 
discussed in next section. 

2.2 Binary crossover operator 

In many applications of binary-coded GAs, a single-point eros.sover is used. In a single-point 
cr'ORSover, a random cross site along the length of the string is chosen and the bits on one 
side of the cross site are swapped between two parent strings (Fig 2.3). In a .single-variable 
optimization problem, the action of the crossover is to create two new children strings from 
two parent strings. In a multi-variable optimization problem, each variable is usually coded in 
a certain number of bits (collectively called a substring) and these substrings are combined to 
form a bigger string. Since, in a single-point crossover operator, only one cross site is chosen 
from the complete string, the action of the single-point crossover may disrupt one variable 
or a combination of continuous variables simultaneously. In the following, we calculate the 
probability of creating a child point from two chosen points under the single-point crossover. 

2.2.1 Search power of single-poiut crossover 

In this thesis, wo a.sRumo tliat tlio .soaicli power of a cro.s.sover operator is a measure of how 
flexible the operator is to create an arbitrary point in the search spa,c('. Hero, wo calculate the 
search power of a crossover operator by finding tlie probability of creating an arbitrary point in 
the search .space from two given parent points^. To make the analysis simpler, we also assume 
that tlio function is a single- variable function. 

When two parent binary strings of length £ are crossed at an arbitrary site, the contents 
on either side of the cross site gel swapped between the parents. Let us imagine that the cross 
site is k bits from the right of the string. We also assume that the leftmost bit is the highest 
significant bit in the underlying binary coding. If the decoded value of the rightmost k bits in 
the first parent is Ai and the decoded value of the rest (£ - k) bits is Bi, then the decoded 


®The analysis in this section also appears in our earlier paper (Deb and Agrawal, lUO-l). 
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Figure 2.3: The action of single-point crossover on two random strings is shown. DV stands 
for the decoded parameter value. Notice that the average of the decoded parameter values is 
the same before and after the crossover operation. 

value of the string is as follows: 

x, = «,2^- + /l,. (2.1) 

The decoded value of the second parent string can also be calculated by knowing A 2 and B 2 
as the decoded values of the right k bits and the rest (£ — k) bits of the string, respectively: 
X2 = B22^ + A2. Figure 2.3 shows the single-point crossover operation on two arbitrary strings 
and the resultant children strings obtained by crossing at a particular site. The effect of the 
single-point crossover operator is to swap the values Ai and A 2 between the two parent points. 
Thus, the decoded values of the children strings are as follows: 

j/i = B,2’‘ + A2, (2.2) 

y2 = B22'^ + A^. (2.3) 

The above equations bring out an important property of the single-point crossover. Notice 
that the mean of the decoded values of the parent strings ((xi + X 2 )/ 2 ) is the same as that of 
the children strings (( 1/1 -|- 2/2)/2). This can also be observed in Figure 2.3, by calculating the 
decoded parameter values of both |)areiit and children strings. It is interesting to note that 
thi.s property of decoded parameter values holds good for real patameter values of parent and 
children points for a linear mapping rule. Assuming a linear mapping rule, the parent points 
can be obtained as follows: 

f=l,2, (2.4) 

where m and t are two constants. The children points can also be computed using the same 
mapping rule as follows: 

Ci = niyi + t, ?= 1 , 2 . ( 2 . 6 ) 

Using the above two equations and the previous observation on i/,-, it is trivial to show that 
mean of the parent points is the same as that of the children points. In other words, the children 
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points arc equidistant from the mid point of the parent points. We shall use this property to 
develop the leal-coded crossover operator later. Further, we observe that the children points 
may lie inside or outside the region bounded by the j)aront strings depending on tlio tlie strings 
and the location of the cross site. In order to deline the spread of the children points with 
respect to that of the parent strings, we define a spread factor /? as the ratio of the spread of 
children points to that of the parent points: 


fS = 


Cl - C2 
Pi -P2 


( 2 .( 1 ) 


With the above definition of the spread factor, we classify crossovers in three different classes: 


Definition 1 All crossover operations having a spread factor j3 < 1 are contracting crossovers. 


When the children points are enclosed by the parent points, the spread of the children points 
(the absolute difference in c\ and cj values) is smaller than that in parent jioinls. Thus, the 
spread factor /3 is less than one. Since the children points usually replace the parent points 
in nonoverlapping genetic algorithms, this crossover seems to have a contracting effect on the 
parent points. 


Definition 2 All civssover operations having a spread factor (3 > 1 arc expanding crossovers. 

When the children points enclose the parent points, the absolute difference in children points 
is more than that of the parent points. This crossover has an effect of expanding the parent 
points to form the children points. 


Definition 3 All crossover operations having a spread factor (3 = 1 are stationary crossovers. 

In stationary crossovers, the children points are the same as the paient points. One interesting 
feature of the spread factor (3 is that it hides the effect of the true spread of the children points. 
The spread factor (3 signifies a spread of children points relative to that of the parent points. 
For a fixed value of /3, two close children points can be obtained if the parent points are close 
to each other or two distant children points could be obtained if the parent points are far away 
from eacli other. In a single-point crossover, the occurrence of a contracting, an expanding, or 
a stationary crossover depends on the parent strings and the location of the cross site. 

With the above definitions of different types of crossovers, let us now compute the probabil- 
ity of occurrence of three different types of crossovers for any two random parent strings. We 
first consider that two f-bit parent strings are crossed at a site k e (0,f- 1) from the rightmost 
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bit. Once again, we assume that the leftmost bit is the most significant bit. Representing two 
parent strings as ai,6, e {0, 1}, we write the decoded values as follows: 


II 

(2.7) 

t=0 


r-1 


1=0 

(2.8) 


Using the proiierty of single- point crossover operator given in equations 2.3 and 2.3, wo obtain 
the decoded values of the children strings as follows: 


k~] t-l 

Px = + 

t^=0 i—k 

k~\ £-1 

(2.9) 

2/2 = + 

(2.10) 


1=0 i=k 


I'or a linear mapping rule (equations 2.4 and 2.5), the spread factor defined in equation 2.6 
can be written in terms of decoded values of parent and children strings. Thus, equations 2.8 
through 2.10 can be used to write the spread factor fi. Hut we simplify the expiession foi tlu' 
spread factor by assuming a parameter u, = (oi - d,) for all f = 0, 1, . . . , (f - 1). Note that 
each Ui can take a value —1,0, and 1. 


m 


Pi - 1/2 
*1 - 312 ’ 

i - C(fc) 
i + C(fc)’ 


where ((fc) = u,-2‘/ u,2’. The above term for (3 is valid for any two parent strings 

(or in other words for any u; values) and for any crossover site (or for any k). But, let us 
first consider the contracting crossovers (where 0 < /? < 1). In order to simplify our analysis 
further, we assume that parent strings arc two extreme strings having all ones (111. ..1) and 
all zeros (000. . .0). In this case, the difference in allele values in each bit is one (or «,• = 1 
for i = 0, 1, ...,(£ - 1)). Substituting u; = 1 in the above expression, we obtain the following 
expression for /?: 

I5{k) = 1 - 2(2'“’ - l)/(2^ - 1) « 1 - (2.11) 


The above spread factor for two extreme strings of length 20 crossed at different sites is plotted 
in figure 2.4. It is clear from the figure that for most cross sites k, the spread factor (3 Ki 1. 
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k 


Figure 2.4: The distribution of 0(k) versus fc for ^ = 20 is shown. 


With a distribution of the spread factor as a function of the cross site, we can now compute 
the probability of creating any arbitrary string (a representative J) from two given parent 
strings. Let us consider figure 2.4 again. For a particular value of 0. we consider a small range 
d(5 and find how many crossover operations {dk) would produce children points in that range. 
This can be achieved by calculating the slope of the above expression at the corresponding J: 


dk = 


1 




dk 

0 


d(3. 


(2.12) 


Denoting the probability of occurrence of /? by C(/3) and knowing that 0 lies between zero and 
one for contracting crossovers, w'e obtain the distribution for d0 — »■ 0: 


m = 


1/ 

M! 

dk 

P 

/o' (l/ 

dl 

dk 

Pj 

\d0 


(2.13) 


The denominator of the right-side expression is a constant quantity. Here, we are not interested 
in computing the above distribution exactly, rather we are interested in finding the functional 
relationship of the distribution with 0. Thus, it will suffice here to compute the numerator 

of the above expression only. By differentiating equation 2.11 with respect to k and substituting 
the expression for k in terms of /3, we obtain the numerator of the right-side expression in 
equation 2.13 as follows: 

P.ui 

where «: is a constant term. The functional form of the above distribution is shown in figure 2.5 
(for 0 < d < 1). The figure shows that as d is increased towards one, the probability increases. 
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Thus, the probability of creating children points closer to the parent points is larger than that 
of points far away from the parents. 

The distribution for expanding crossovers (/3 > 1) can be obtained by considering two 
other parent strings^ and by using the above procedure. However, this distribution can also be 
obtained from that of the contracting crossovers by realizing a simple probabilistic event. If two 
children strings found in a contracting crossover are crossed at the same site, the same parent 
strings are obtained. Thus, for each contracting crossover with a spread factor /3, there exists an 
expanding crossover with a spread factor 1//3. In other words, we can argue that the probability 
of occurrence of contracting crossovers having a spread factor between J and (/3 + d/3) would be 
the same as the probability of occurrence of expanding crossovers with a spread factor between 
l/(/3 + dP) and l/p. Summing these probabilities for aU contracting crossovers in the range 
0 < /3 < 1, we conclude that the overall probability of contracting crossovers is the same as 
that of the expanding crossovers'*. Equating the probability of expanding crossovers {£{P)) 
having spread factor between 3 and (/3 + d0) and that of contracting crossovers (C(p)) having 
spread factor between l/(/3 + d/3) and 1/0, we obtain the following: 


£(0)[(0 + d0)-0] = C(j) 


0 0 d0 


Rearranging terms and in the limit d0 0, we obtain 


(2.15) 


m = 


1 

0 ^ 



(2.16) 


With this relationship between the probability of occurrence of contracting and expanding 
crossovers, we only have to know the probability distribution for only one type of crossovers. 
Since their sum is the overall probability of all crossovers, the overall probability for either 
contracting or expanding crossovers must be equal to 0.5. For the probability distribution 
given in equation 2.14, we obtain a probability distribution for expanding crossover using the 
above equation: 


£{ 0 ) = 


K 


(2.17) 


0{0-iy 

This distribution is plotted in the region 0 > 1 m figure 2.5. The figure shows that the 
probability of creating children strings having a large 3 is small. 


® Notice that with two extreme strings considered in the contracting crossovers the expanding crossover is 
not possible. 

^Of course, the stationary crossovers can be considered as either contracting or expanding. For the sake of 
argument, we can assume that half of the stationary crossovers are contracting and rest half of the stationary 
crossovers are expanding. 
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P 

Figure 2.5: Probability distributions of contracting and expanding crossovers on two extreme 
binary strings are shown. 




Figure 2.6: Probability distributions of contracting and expanding crossovers on all pairs of 
binary strings of length 15 aie shown. 

llccaJl that o(|uatioiis 2.H and 2.17 arc derived only for two extreme binary strings. In order 
to investigate the above distributions and probability distributions for other parent pairs, we 
create all [lairs of binary string, s of a particular length and cross them at all possible cross sites. 
For each crossover, the corresponding spread factor /3 is calculated and arranged in ascending 
order. The experiment is continued for string lengths of 10, 15, and 20. Figure 2.6 shows the 
distribution for t = 15. Distributions obtained for other string lengths also look similar. The 
fig. 2.6 shows that the probability of occurrence of fa 1 is more likely than any other ft value. 
If the parents are closer, the spread of the two likely children is also smaller. On the other 
hand, if the spread of parents is more, the spread of likely children is also large. This aspect 
of the crossover operator distinguishes it from the mutation operator. In a mutation operator, 
the |)robal)ility of a mutated .string clo.se to the paiont string is higher, but Ibis probability is 
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usu 3 -lly constant and depends only on one string, 

I ho next section deals with the existing real-coded crossover operators. 

2.3 Existing real-coded crossover operators 

iilvoii UloiiRii l,lio shortcomings of l)iiiary CAs can i)o romovod by roal-codod GAs, llic iiilorcst 
it) lliG study of roal-codod GAs has l)Gguii vory rocoutly, and tliore still romaiii soino objt’ctioiis 
to real-coded GAs which must be resolved. Most of the theoretical objections to real-coded 
GAs assume that the crossover operator operates at parameter boundaries. But many imple- 
mentors of real-coded GAs use crossover operators with extensive search power. The real-coded 
crossovers present in literature are as below : 


1. Davis s rrossovrv. Davis (199 1 a) has devised a crossover that averages some of the 
variables. Thus if two parent points are pi and p 2 , the child point is taken as 0. 6 ( 7)1 + 7 ) 2 )- 

2. Wright’s linear crossover. Wright (1991) suggested using a linear crossover operator 
which generates three child points 0 . 6 ( 7 )! + 7 ) 2 ), (1.5pi~ 0 . 6 ^ 2 ), and (- 0 . 5 pi-H. 5 p 2 ). This 
study also suggested a crossover operator that creates a point by randomly perturbing a 
single variable in its range.® 

3. Radcliffe’s flat crossover. This crossover (RadclifTc, 1993) chooses values for a child 
point by uniformly randomly picking a value from the range pi to P 2 . Thus it uses a 
probabilistic approa,ch to create a child point, which is desirable in GA. 

4. Eshebnan’s blend crossover (BLX-a). Eshelman and Schafl'er (1993) have suggested a 
blond crossover (BLX-a) operator. For two parent points pi and P 2 (assuming pi < P 2 ), 
the l)l;X -a laiidomly |)irks a point in tlie range (pi - n{p 2 - 7 ) 1 ), 7)2 + a{p 2 - Pi)). If 
a is zero, lliis crossover becomes same as Radcliffe’s flat crossover. In a number of tost 
problems, they have reported that nLX-0..6 (with a = 0.5) porforin.s better than the 
BLX operators with any other a value. 

2.3.1 Search power of real-coded crossovers 

For the above real-coded crossovers, we investigate the search power, as defined in the previous 
section. The search powers of Davis’s crossover or Wright’s linear ciossovcr arc clearly not 
adequate, because only one and three children points can be obtained from two parent strings, 


^Ilowcvor, this is more; like a mutation operator, rather than a crossover. 
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respectively. RadclifFe’s flat crossover or Eshelman and Schaffer’s BLX-0.0 operator has the 
fl('xil)ility of croiititif!; any iioint (Miclosed i)y tlie parent i)oiiits, lliiis tiaviiif^ a. better search 
power. Eshelman and Schaffer (1993) have also introduced concept of interval sciioinata, and 
claimed that BLX crossover preserves the interval schemata representing continuous regions in 
the search space. 

Interval schemata are defined by all possible subranges in the variable space. Considering a 
variable whose range is (0,k), there will be a total of 0.5(A:+l)(^;+2) interval schemata according 
to the formulations of Eshelman and Schaffer. It is assumed that the points represented by 
an interval schemata are bounded by integers only. We consider any two parent points in the 
variable range, and then calculate the number of common schemata between them and an 
arbitrary children point. When llic.so arc plotted in terms of spread factor /3, we note that 
tlioy decrease linearly for /3 > t, as shown in the fig. 2.7. A detailed analysis on this issue 
can be foniul in our earlier paper (Deb and Agrawal, 199d). It is to be noted that IHjX-O..') 
deviates from this ideal distribution, in the sense that it gives equal probability to all /? values 
in the range (0,1.5). Hence, we see that the search power of BLX-0.5 is radically dillcrcnt than 
binary crossover. It may be argued that BLX-0.5 has higher schemata destruction rates as 
compared to binary crossover. Motivated by this observation, we devise a new crossover (we 
call it simulated binary crossover) in the next section. 





figure 2.7: Brobalrilily distributions of BLX-0.5 and the ideal distribution are shown witli 
spread factor /3. 

2.4 Simulated binary crossover (SBX) 

Simulated binary crossover developed in .the present work is a different type of real-coded 
crossover in a sense that it uses a particular probability distribution to generate children 
points. We have designed this probability distribution to be similar to that found in actual 
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Figure 2.8: Probability distributions of contracting and expanding crossovers for SBX model 
are shown. 

binary crossover. One property of a probability distribution is that the cumulative probability 
of all possible states is equal to one. But the integration of the probability distribution in 
equation 2.14 over all values of J is not a finite number, this is because at /3 = 1 the probability 
does not exist. Hence, it is not a valid probability distribution. Moreover, recall that the 
distribution obtained in equation 2.14 is only for two extreme binary strings as parents. For 
contracting crossovers, we assume a simple polynomial probability distribution which is similar 
to the distribution in equation 2.13 and to that shown in figure 2.6. 

C{3) = 0.5(n+ l)b". (2.18) 

where n is any nonnegative real number. The factor 0.5 makes the cumulative probability at 
= 1 equals to 0.5. For expanding crossovers, the probability distribution can be derived by 
using equation 2.16 and 2.18: 

^(/3) = 0.5(n+l)^. (2.19) 

A large value of n allows a better approximation of the above distribution with equation 2.14 
for large values of /3. A small value of n makes a more uniform distribution in the range 
0 < li < 1. A value of ra = 0 makes a perfectly uniform distribution in that range (Figure 2.8). 
A moderate value of n (2 to 5) matches closely with the simulation results for the single^point 
crossover in binary-coded G.A.S. The probability distribution for contracting and expanding 
crossovers for various values of n are shown in figure 2.8. 

Using the probability distribution given by equations 2.18 and 2.19. we now present the 
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spread factor — > 

Figure 2.9: Calculation of the spread factor 3 from a given random number u. 


algorithm for calculating children points with the SBX crossover operator. 

SBX Algorithm 

INPUT : pi, P 2 (Parent points) 

OUTPUT : Cl, Cl (Children points) 

• Generate a random number u such that 0 < u < 1 


• Calculate the spread factor, 


(2u)‘’’+** if u < 0.5, 

/5(u) = { \ , 

( 20 ^)'"^" otherwise. 


• Calculate children points, 


Cl = 0.5(pi +P 2 ) - 0.5^(u)|p 2 -Pil 


C 2 = 0.5(pi + P 2 ) + 0.5/3(u)|p 2 - pi| 


It is interesting to note that the number u used in the above algorithm represents the area 
under curve from J = 0 to /i = P{u) (Fig. 2.9). We could have also assumed u to represent 
the area under curve for 3 = 0{u) to 3 = 00 . In that case, equations for calculating /3(ii) can 
be suitably derived. 

The polynomial probability distribution given in equation 2.18 is one of many possible prob- 
ability distributions for contracting crossovers. For a different distribution, the corresponding 
probability distribution for expanding crossovers can be calculated by using equation 2.16. It 
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is worUi niciilioiiiiig iierc tlial, a distribuUoti lor expanding crossover can also bo assumed lirsi, 
and Uion Uic corresponding probability distribution for contracting crossovers can be com- 
puted by replacing f(/3) with C(/3) in equation 2.1(5. In all such probaliility cli.stribntions, 
the probability of creating children points close to the parent points must be more than that 
of creating children points far away from the parent points (This simulates the effect of the 
single-point crossover as found in figure 2.4). However, in the remainder of the thesis, we use 
the polynomial probability distribution given in equations 2.18 and 2.19. 

The concept of probability of crossover (pc) used in canonical GAs still applies to this real- 
coded GA. 4’his probability inoa,ns that on an average of the points in the mating pool 
are participated in the crossover. 

2.4.1 Properties of SBX 

The exponent ?i in the probability distribution equations is an important parameter. It is 
clear from liguie 2.8 that the distribution largely depends on the exponent n. For small values 
of n, points far away from the parents are likely to be chosen, whereas for large values of n, 
only points closer to the parents are likely to be chosen. In a sense, the exponent n is similar 
to the reciprocal of the temperature parameter (7’) used in simulated annealing algorithm 
(Kii kpatrick, ct.al., 198;i). Ideally, a good search algoiithin must have a, broad si'aich (with 
large T or small n) in early generations and as the generations proceed the search must be 
focussed on to a narrow region (with small T or large n) to obtain better precision in the 
solution. However, for brevity, wo use a fixed compromised n for the entire simulation. 

Let us now check the validity of equations 2.18 and 2.19 as a probability distribution curve. 
Any ideal [irobability distribution curve should be at least C° continuous and the area under 
curve should bo equal to unity. 

Area under curve for = 0 to 1, 

/li = 


Area under curve foi ft = 1 to oo. 



f 0.5(11+ l)ft”-dft 
7 / 3=0 


0.5(n-M) 

().r> 


/ 3 "+^ 

( 71 + 1 ) 
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= 0.5(71+1) 
= 0.5 


1 Y 
. (tt + ]}/5'‘+'Ji 


lloiico areas under curve for contracting and expanding crossovers are separately ccpial to 
0.5 and when taken together, the area under whole curve sums to unity. Also note that the 
ordinate of distribution curve for /3 = 1 is found to be 0.5(?i + 1) by both equations 2.18 and 
2.19 which ensures C” continuity of the curve. 

It is interesting to note that the probability distribution for SBX with n = 0 is simila.r to the 
probability distribution calculated in terms of common interval-schemata of previous section. 
For 77 = 0, all contracting crossovers have a uniform probability and the probability in the 
expanding crossovers diminishes with the increase in the spread factor. The only discrepancy 
between these two distributions is that in the SBX operator with n = 0, the probability reduces 
in a nonlinear manner and that in the ideal intnrval-schoma procc.ssing, the probability reduces 
linearly. Nevei theless, we can argue that the SBX operator with n = 0 respects interval-schema 
processing better than the BLX-0.5 operator. However, other SBX operators with nonzero ??, 
do not respect interval schema. This analysis suggests that the other SBX operators may 
respect some other schema which represents contiguous points but with varying importance. 
The formalization of this schema processing is a mathematical exercise, which would be of 
little importance to the current study. Thus, we avoid that exercise in this thesis and focus 
our attention to study of above described crossover only. 


2.4.2 Analysis of effect of 7t over expansion of population 


111 the jirevioiiH .section, w(' stated that value of exjioiieiit n basically rodects the projnmsity of 
GA search. In the following, we give a more formal discussion on this issue. Since spread factor 
ft represents a normalized parameter for the spread of children poinhs, we wish to cstablisli 
a relation between n and probability that a certain value of ft will be exceeded in a set of 
N randomly generated ft values according to SBX probability distribution. For this purpose, 
let’s assume that population size is 2A, giving rise to total N random instances of ft each 
generated according to SBX distribution. Further, let p{fto} denote the probability that a 
single random value of ft will not exceed a specilic value fto- This probability p(/io) 
obtained by iiitegraliiig the equations 2.18 and 2.19 from ft = 1) to fto, 


1 / 371+1 

2P0 

1 --i 


2 / 5 ? 


rt+T 


p{fto) = 


if fto < 1.0 
otherwise. 
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Figure 2.10: Effect of exponent n on probability of spread of (5 values. 
Now, the probability that all N instances of /3 wiU be confined to range (0,/9o), 

mm = Wo)f 


since all N events are independent to each other. Further, the probability that at least one (5 
value generated exceeds /?o, wiU be given by, 

f 1 - 

= 1 - Piv(^o) = < f j 

The equation 2.20 is used to plot the figure 2.10 which shows Ps(P) versus /3 for various 
values of n. The figure illustrates that for smaller values of rt, there are higher chances to 
generate far-off points (higher 3 values) whereas low values of n wiU most probably confine to 
smaller ranges of (3 spread. 

2.4.3 Problems involving fixed variable bounds 

Figure 2.8 shows that the search of SBX crossover extends practically to any point in the real 
space. This allows the children points to lie anywhere between negative oo to positive oo. In 
certain problems, the search space may be bounded. For example, if the length of a member in 
an engineering component design is a variable, negative values of the variable do not make sense. 
The SBX operator as described above does not always guarantee a child point with a positive 
value, even though both parents may have positive values. The contracting and expanding 
probability distributions can be modified so that the probability of creating points outside a 


AT 


if f3o < 1.0 
otherwise. 


( 2 . 20 ) 
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Figure 2.11: Modification of probability distiibution curve for rigid bounds on variables. 

given bound is zero, while maintaining the same nature of the probability distribution. The 
way we have achieved it is that for any two given parent points, the cumulative probability of 
the children points within the given bound is first calculated. If this probability is a (say), the 
probability distribution fungtion is multiplied by a factor 1/a, so that the overall probability 
for points within the bounds is equal to one. Fig. 2.11 shows the distribution curves before 
and after modification by this strategy. 

In the following, we discuss how the SBX operator can be extended to multi-variable 
optimization problems. 

2.4.4 Extension to multi- variable problems 

The SnX developed above for single- variable functions can be easily extended for multi- variable 
problems. Thcic can bo various approacliG,s to do this, of which we have implemented the 
following three approaches : 

• Approach 1 : Simulation of single-point binary crossover ; 'I'lie mn.st cninmoii way to 
code multiple variables using a binary coding is to use a mapped concatenated string 
(Goldberg, 1989). In a single-point crossover, one cross site is chosen along the entire 
string and the contents on the right-side of the cross site are swapped between the parents. 
When binary-coded GAs are used to solve a A-variable function, and when the cross site 
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falls OB. tke bits represeating the fc-th variable from the left, the siagle-poiiit crossover 
creates a aew value of the fc-th variable and swap the (fc + l)-st to iV-th variables betweea 
the parents. The same operation can be simulated by using the SBX operator on the 
fc-th variable (fc is chosen between 1 to iV at random) and swapping the {fc -1- l)-st to 
iV-th variables between the parents. 

• Approach 2 : Uniform crossover : It has been shown elsewhere (Booker, 1993) that the 
feature of the single-point crossover discussed in approach 1 causes a large positional 
bias for certain combinations of variables to get propagated from the parents to children. 
Hence in the second approach, we choose a mechanism similar to uniform crossover for 
multiple variables (Syswerda, 1989) for choosing which variables to cross. We perform 
SBX crossover in each variable with a probability 0.5. This causes about 50% of the 
variables to be crossed, on an average. Note that a different value of random spread 
factor wiU be generated in general, for each variable that is crossed. In the case of 
single-variable functions, since there is only one variable, the SBX is performed with a 
probability 1.0. 

• Approach 3: Children on a line : If we calculate a single random spread factor /3 and 
apply it on each variable, the resulting children points will be aligned with the parent 
points. Even though tliis strategy creates some bias in the search space, this sort of bias 
is sometimes useful for certain problems (see table 3.10 in chapter 3). 

2,5 Real-coded mutation operator 

In real-coded GAs, the mutation operation can be achieved by choosing a neighbouring point 
in the vicinity of the current point (Back, Holfmeister, and Schwefel, 1991). Following the same 
strategy, a real-coded mutation operator is developed on the same lines as SBX operator. This 
operator is applied on a single parent point (as in traditional GAs) and results in a perturbance 
of its parameter values. This perturbance is calculated again using a probability distribution 
which has the property that small perturbations are much more probable than large ones. In 
order to explain the working of this operator, we first define the perturbance factor 6 as the 
ratio of actual perturbance A to maximum allowed perturbance A^ax • 

A ^new ~ ^0 

o — . * . ) 

ZXmox ^max 
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where xq and Xnetu denote the parameter values before and after applying mutation, respec- 
tively. Perturbance factor S is randomly generated using the following probability distribution ; 

P(S) = 0.5(n + 1)(1 - - 1 < < 1 (2.21) 

where n is a non-negative number. This distribution is shown in fig 2.12 for n = 0, 1 and 4. 
Fig. 2.12 shows that this distribution is very similar to SBX’s contracting crossover distribution 
for range ( J = 0 to 1) except that both distributions are mirror image to each other. Here also, 
a low value of n gives chances to far-off children points whereas a high value gives much more 
probability to near-by points. After a S value has been calculated according to probability 
distribution of eq. 2.21, the new value of parameter {x^ew) can be obtained using eq. 2.5. 

Ideally the maximum allowed perturbance Amai should vary with generations, (namely a 
large value initially and smaller values for later generations), but in our implementation we 
have kept a fixed value for Amai equal to half of the search space for the particular variable 
being mutated. 



Figure 2.12; The probability distribution of perturbance factor 6. 


2.6 Binary GA vs real-coded GA with SBX 


:V2 


In this chapter, we discussed the shortcomings of binary GAs, when applied to contiiinons 
search space. A discussion on real-coded GAs was also presented, with a special emphasis on 
SBX (uossover operator developed in this thesis. A real-coded GA using SBX has antiinber of 
advantages over binary GAs or other real-coded GAs as outlined below ; 

1. No problem of Hamming cliff : Since real-coded GAs use the parameter values themselves, 
the question of Hamming cliff does not arise at all. 

2. Adaptive precision level ; Ignoring the fact that computers have limited precision level, 
we can argue that there is no theoretical limit of the precision level, upto which a real- 
coded GA can go. Unlike the fixed |)rofision obtained in binary GA, the rcal-codod GAs 
give a j)recision in the solution which is adaptive in the sense that it keeps on refining 
corresponding to status of convergence of GA. 

3. Flexible variable bounds : The polynomial distribution used in the SBX crossover lets the 
children to lie practically anywhere between negative cx> to positive oo. This feature of 
SBX distinguishes it frotn binary GA or other real-coded crossoveis which opeiate inshh' 
fixed variable bounds. In certain situations, where tliis feature is undesirable, we apply 
the simple modification discussed in section 2.4.3. 

Processing and preservence of useful schemata : It was discussed in section 2.4 that SCX 
with n = 0 processes interval schemata. SBX with other value of n may respect some 
other schemata which represent continuous regions. A probability distribution used for 
SBX lueservcH more number ofinteival schemata than BhX, in general. This can bo 
realized by following consideration. Let’s assume that two parent points pi and p 2 are 
used to generate children points Ci and C 2 . If child points are created exactly at middle 
of parent points (which corresponds to 0 =0), there is maximum destruction of interval 
schemata. SBX with a non-zero n value gives zero probability for this case, whereas BLX 
gives equal probability for any 0 value between 0 to 1. 

V 

In the next chapter, we present our simulation results on a test-bed of 12 functions, which 
gives a validation ol aj)j)licability of SBX. 



Chapter 3 

Simulation Results of SBX vs 
Binary Crossover 


This chapter presents a summary of simulation results of GA with binary crossover and SBX 
crossover. For the purpose of testing the efficacy of SBX vs binary crossover, we have chosen a 
test bed of 12 functions, of which nine are taken from the existing literature. The test functions 
range from single-variable functions to multi-variable functions, from unimodal functions to 
multimodal functions, and from deterministic functions to stochastic functions. Finally, one 
blocked function introduced .by Goldberg (1991) is included to investigate the effect of blocking 
on the performance of the SBX operator. Table 3.1 gives a summary of all the test functions. 

3.1 Simulation details 

A real-coded genetic algorithm using SBX is implemented in C programming language. Ap- 
pendix I gives a summary of capabilities of this program. The initial population of variable 
vectors is created in an initial range of variables supplied by the user. However, in subsequent 
generations, the population is not guaranteed to be confined to the given range, because the 
SBX operator has a nonzero (though small) probability of creating children points far away 
from the parent points. This is advantageous in problems where the location of the true op- 
timum is not known beforehand. We demonstrate this flexibility of real-coded GAs with the 
SBX operator in test problems by initializing the population far away from the true optimum.^ 
In all our simulations, we allow the SBX to create any point in the real space with probability 
distributions given in equations 2.18 and 2.19. In order to investigate the effect of the prob- 
ability distribution of contracting and expanding crossovers in the SBX operator, we present 

^However, this feature of the SBX operator is redundant for problems where the lower and upper limits of 
the variables are absolutely known. In those cases, the modification as discussed in section 2.4.3 can be used. 
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'I'ahlft A fiiiitiiiiiuy of Uio t(;sl, functions 


S.No. 

Function Name 

1 

Number of 
variables 

Conirncnts 

1. 

V-fuiiction 

1 

unimodal, smooth, continuous 

2. 

V-cliff function 

1 

unimodal, smooth, 
discontinuous at optima 

3. 

Bimodal, equal spread 

1 

bimodal, smooth 
equal spread of both basins 

4. 

Bimodal, unequal spread 

1 

bimodal, smooth 
global basin narrower 

5. 

Pole problem 

2 

four optima, global basin narrow 

G. 

De .long’s FI 

3 

smooth, unimodal 

7. 

De Jong’s F2 

2 

multimodal, smooth 

8. 

Do Jong’s F3 

5 

discrete, stair-case type, unimodal 

9. 

Uo Jong’s F4 

30 

iiitimotlal, stotliastic (noisy) 

10. 

De Jong’s F5 

2 

25 optima, global basin narrow 

11. 

Rastrigln’s function 

20 

10^° optima, one is global 

12. 

Blocked function 

2 

multimodal, smooth, 
global optimum blocked by local ones 


siimilalion luns with a number of different exponent values [n). It may be recalled here that 
for small values of the exponent n, the children points far away from the parent points can be 
obtained, but for larp,e ?i, only points chmer to the piuent points are likely to he generated. 'I'o 
investigate primarily the elfccl of crossover alone, we have kept mutation piobability zero. 

It is important to mention here that the performance of GA can not be menstired triiely by 
a single run. Since GAs use probabilistic approach in it’s search process, it is possible that even 
with the specilication of the same GA parameters, the performance of GA run may change 
substantially by initialization with a different population. In GA literature, the performance 
of GA is measured usually by running the algorithm several times, each time with different 
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initial population and reporting it’s reliability which is the ratio of number of successful runs 
and the number of total runs (Eslielman and Schaffer, 1993; Cordon and Whitley, 1993). W(' 
lollow the name nl.i i\l.e)i,y. I'oi' em h C A tun, Hie lei ml mil |im m nil n II aiiv ime (m mm e ) el the 
roilnwiiig cilterla in mithilled. 

1. 'I he best iiidi vidiial in the' {mpiilnlioti hi chide |,n llie (file optimal point vvllh it nmnll 
tennination margin f in all problem variables. In this case, we assuiuo that the GA has 
ruuiid the true optimal jioint. 

2. 'file function value of the best individual in the population is better than or equal to a 
target function value /j. In this case, we say that the GA has obtained the true optimal 
solution. 

3. All individuals in the population have converged to a small neighborhood (within a smalt 
margin c in all problem variables) to a point other than the global optimum point. This 
case is termed as premature^ convergence. 

A. The number of generation has exceeded a specified large number T^ax. In this case, the 
GA has not converged to any solution at alt. 


Unless specified, all simulations use the following GA parameters : 

• Maximum generations 2'max ^ 200 

% 

• 'farget function value /j- : Function value at global optimum 

• Selection scheme : Binary tournament without replacement 

• Crossover for multi-variables : Approach 2 of section 2.4.4 (Uniformly 50% variables) 

• Crossover probability Pc : 1-0 

• Mutation probability p,„ : 0.0 

The performance of real-coded GAs with thcSHX o|)e,rator is compared with Hl/X-O.f) and 
with single-point crossover. Average, number of function evaluations rexpiired to successfully 
solve a iiroblem (when either criterion I ot 2 above is sati.sfied) is tabulated for each (iroblem. 
Other GA parameters are mentioned in the description of the simulation results, given in the 
following. 

^Notc that if GA converges to a small neighborhood of global solution, then criterion 1 or 2 will be satisfied 
before thi.s. 
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3.2 V-function 

This function is a single-variable function defined in the range 0 < a; < 1 (Eshelman and 
Schaffer. 1993) and is shown in figure 3.1. The function has only one minimum point at 
X = 0.5: 

AC®) = lx - 0.5|. (3.1) 



First, the binary-coded GAs are used. With a string length of 30 and a population size of 
50. binary-coded GAs with the single-point crossover operator find a point with six decimal 
places of accuracy (e = 10"®) only 23 out of 100 times. In rest of the runs, GAs have converged 
prematurely to a point close to the true optimum, but not as close as within e from the optimum. 
The average function value of the best individual in all 100 runs is found to be 0.0015, which 
is very close to the function value of the true optimum (/^ = 0.0). It is important to note 
that the minimum point of the above V-function corresponds to a Hamming chff in the binary 
coding used, but with e = 10"® there are about 2,147 strings representing the region (0.5 — e, 
0.5 -f c). However, the presence of Hamming cliffs is one of the problems of the binary coding, 
as encountered by other researchers (Davis, 1991b). 

Real-coded GAs with the SBX operator are applied next. A summary of the simrdation 
results is tabulated in Table 3.2. A termination margin of e = 10"® is used. The results show 
that the real-coded GAs with the SBX operator find the optimum point in all simulations with 
the chosen tolerance. Notice that the performance of the SBX operator remains the same for 
different \’alues of the exponent n. 


The success rate of BLX-0.5 operator on this problem is equally well to SBX. Since the 
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Jable 3.2: Simulation results of GAs with different crossover operators on the V-function are 
tabulated. Ihc number in brackets in column six indicates the number of times GAs did not 
converge to any point at all. 


Operator 

Range 

Popsize 

n ox 1 

Performance of 100 runs 

Avg. Trials 

Success 

Prem. conv. 

l-pt 

(0,1) 

50 

30 

23 

77 

834. K 

SBX 

(0,1) 

50 

0 

100 

0 

929.5 




2 

100 

0 

00 




5 

100 

0 

818.5 



100 

0 

100 

0 

1739.0 




2 

100 

0 

1396.0 




5 

100 

0 

1321.0 

BLX-0.5 

(0,1) 

50 


100 

0 

746.0 



100 


100 

0 

1368.0 

SBX 

(().!), 1) 

50 

0 

~ioo “ 

0 

1279.5 




2 

100 

0 

2018.5 

BLX-0.5 

(0.9,1) 

50 


24 

76 

2604.2 

SBX 

(0.9999,1) 

50 

0 

100 

0 

■SEH 




2 

100 

0 


BLX-0.5 

(0.9999,1) 

50 


0 

01(39) 



SBX operator performs expanding crossovers about the half the time and searches more region 
than 13LX-0.5, the function evaluations required to solve the above problem is less in the case 
of BLX operator. But, when the same problem is solved with a population initialized in the 
range (0.9, 1.0), the performance of SBX is much better than that of the BLX-0.5 operator. 
The siicces.s rate of S BX is the same as before, but the success rate of BLX-0.5 reduces to about 
25%. Recall that the minimum point does not lie in the range where the initial population 
is cr(’at(!<l. In order to find the optimum point, the search operator has to searcli outside 
tliis range. Obviously, the binary-coded GAs will fail to find the true optimum in this case, 
because of the flxcd-mapped coding of the variable. The search power of SBX is further tested 
by starting with ])opulatious confined in the region (0.9999, 1.0000). The termination margin 
used in tliis case is also 10~®. With SBX, the expanding crossovers find better points outside 
the initial range and the whole population migrates towards the true optimum. B he success 
rate of SBX is again 100% in this case, whereas BLX-0.6 performs poorly. 
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Figure 3.2: Effect of population size and exponent n on performance of GA on V-function. 
3.2.1 Parametric study 

After gaining the confidence in working of SBX in this simple test function, wc began with 
study of effect of various parameters on the performance of GA. We selected two parameters, 
population size and tolerance (c) for this purpose. The effect can be seen in the number of 
successful runs and the average trials given for successful run, which are tabulated in table 3.3 
and table 3.4 and plotted in figs. 3.2, 3.3 and 3.4. These figures show that for small values 
ol 7j, blJX works sati.sfactoiily ahno.st independently to population size and tolerance value. 
For larger n, the reliability decreases with decrease in population size (fig. 3.2) or with smaller 
tolerances (fig. 3.3). It is not surprising, because for higher values of n, there are more chances 
of GA to converge prematurely, and there is likelyhood that the solution is not achieved with 
desired accuracy. Fig. 3.4 is a plot of tolerance vs average number of trials given for successful 
runs, for n = 1 and population size = 25. The linear nature of curve depicts that if GA can 
find a solution with some prescribed precision in some particular niiinbcr of trials, the higher 
precision in the solution can be obtained with some more function evaluations. This also gives 
a validation to our claim that real-coded GAs with SBX can goto any arbitrary precision in 
finding the solution. It is interesting to note that the simulation curve of fig. 3.4 can also be 
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Figuio 3.3: ICd’oct of loloranco and exponent v on i)eiTorniance of (f A on V-rimction. 
approximated by the following equation. 

Avg. number of trials {Navg) = a — b In e 

where a and b are some constants. This shows that the computational overheads of the 
algorithm (here expressed as average number of function evaluations) increase as logarithm of 
desired acoiracy. 
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Figure 3.4: Efrcct of tolerance on average number of evaluations required by GA for V-function. 

3.3 V-cliff function 


This function is similar to the V function, except that the function has a discontinuity at the 
minimum point (.t* = 0.5) (Eshelman and SchafTer, 1991) and is shown in lig. 3.5: 


/ 2 (*) = 


0.6 — X, if X < 0.5; 
X - 0.5, otherwise. 


(3.2) 


The tolerance parameter r = 10"® is used again. Table. 3.5 shows the performance of 
binary-coded GAs with the single-point crossover, real-coded GAs with SOX and BLX-0.5 on 
this function. 
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'I'ablc 3.3: Sitnulalion results of GAs with din'ereut population sizes oil the V-fuiiction. 



Range 

Popsizo 

11 

Performance of 100 runs 

Avg. Trials 

Success 

Prom. conv. 

10--^ 

(0,l) 

10 

0 


1 

93.7 




1 


13 

90.5 




2 

68 

32 

73.8 , 




3 

50 

50 

73.0 




5 

27 

73 

54.8 




8 

21 

79 

40.5 




10 

13 

87 

39.2 



25 

0 

100 

0 

143.0 




1 

100 

0 

132.0 




2 

100 

0 

131.0 




3 

95 

5 

130.8 




5 

83 

17 

146.4 




8 

59 

41 

133.9 




10 

48 

52 

109.4 



50 

0 

100 

0 





1 

100 

0 





2 

100 

0 

183.5 




3 

100 

0 

217.5 




5 

100 

0 

224.5 




8 

97 

3 

211.3 




10 

90 

10 

244.9 


The binary-coded GAs with single-point crossover is not able to find a solution close to the 
true optimum in 100% simulations. The performance of binary-coded GAs is worse than the 
simple V-function. The real-coded GAs with the SBX operator, on the other hand, performs 
well for small values of exponent n. The deterioration in the performance of SBX with either 
n = 0 and large n is due to the discontinuity at the optimum point. However, the performance 
of BLX-0.5 is 100% with slightly more function evaluations than that of SBX with n = 2. 
The table clearly shows that SBX performs much better than BLX-0.5 when the population is 
initialized far away from the minimum point. 


3.4 Bimodal, equal spread function 


The bimodal function used here has two minimum points one is better than the other (fig. 3.6). 
The basin of attraction of each minimum point is identical to the other: 


r _ exp(-(® - 0.25)V0.01), if X < 0.5; 
/3(a:} - I _o.5exp(-(x - 0.75)V0.01), otherwise. 


(3.3) 
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Table 3.4; Simulation results of GAs with different tolerances on the V-function. 


'folerance (c) 

Range 

Popsizc 


Performance of 100 runs 

Avg. Trials 

n 

Success 

Prem. conv. 


(0,1) 

26 

0 

100 

0 

263.5 




1 

100 

0 

231.5 




2 

99 

1 

220.7 , 




3 

93 

7 

254.0 




5 

72 

28 

231.9 

10-*^ 

(0,1) 

25 

0 

100 

0 

396.5 




1 

100 

0 

3/10.5' 




2 

99 

1 

325.8 




3 

91 

9 

365.7 




5 

59 

41 

326.7 

10-^* 

(0,1) 

25 

0 

100 

0 

536.3 




1 

100 

0 

444.0 




2 

99 

1 

434.6 




3 

89 

11 

461.2 




5 

53 

47 

438.7 

ipr 

(0,1) 

25 

0 

100 

0 

667.8 




1 

100 

0 

563.0 




2 

99 

1 

543.7 




3 

88 

12 

577.6 




5 

46 

54 

535.9 

]()-« 

(0,1) 

25 

0 

100 

0 

747.5 




1 

100 

0 

622.0 




2 

98 

2 

618.4 




3 

83 

17 

647.9 




6 

41 

69 

595.1 


This function has a local minimum at a; = 0.75 and a global minimum at a: — 0.25. llie 
function values at the local and global minima are -0.5 and -1.0 respectively. This function 
is chosen to test whether real-coded GAs with SBX can find the global minimum point or 
not. Table 3.6 compares the performance of both GAs for a termination margin of € = 10 . 

Once again, real-coded GAs with both SBX and BLX-0.5 perform 100% of the time. Despite 
the presence of the local optimum, real-coded GAs find the true optimum point with the 
desired accuracy. In binary-coded GAs, a larger population is required to achieve a desired 
solution accuracy for a problem coded in a bigger string (Goldberg, Deb, and Clark, 1992). 
However, the performance of binary-coded GAs is inferior to that of real-coded GAs with SBX 

or BLX-0.6. 

The performance of neither real-coded GAs is good for runs with adverse initial populations 
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Tabic 3.5: Simulation results of GAs with different crossover operators on the V-cliff function 
are tabulated. The number in brackets in column six indicates the number of times GAs did 
not converge to any point at all. 


Operator 

Range 

Popsize 

n or 1 

Performance of 100 runs 

Avg. Trials 

Success 

Prem. conv. 

1-pt 

(0,1) 

60 

30 

16 

84 

840.6 

SBX 

(0,1) 

50 

0 

88 

0(12) 

2,015.9 




2 

100 

0 

1,382.0 




5 

58 

42 

1,191.4 



100 

— 

96 

4 

2279.2 

BLX-0.5 

(0,1) 

60 

SB 

100 

0 

1,726.0 

SBX 

(0.9999,1) 

60 

0 

88 






1 

100 

HhH 





2 

74 



BLX-0.5 

(0.9999,1) 

50 


0 

62(38) 



(with a population initialized in the local basin). Since, the population is initialized in the range 
(0.5, 1.0), the global minimum point (s* = 0.25) is not included in the range. Thus, the binary- 
coded GAs will not be able to find this minimum point with the above initial populations. The 
performance of the real-coded GAs with SBX is marginally better than with BLX-0.5. Since 
al)ont 50% of the j)ointB arc created outside tlie range enclosed by the parents, a few points 
in the global basin are expected to be created by SBX. Since the binary tournament selection 
sclierrie and 'A po = 1 arc used in the above simulations, the selection pressure is not enough to 
maintain those points in the global basin. When the crossover probability of SBX is reduced 
to Pc = 0.5, the disruption of good points reduces and the success rate of SBX increases to 34 
out of 60 runs with n = 0 (with approximately 2,006 function evaluations). 
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Figure 3.6: Function /a (bimodal, equal spread function). 

3.5 Bimodal, unequal spread function 

This function is a modification of the previous function (fig. 3.7). Here, the basin of attraction 
of the global minimum is smaller than that of the local minimum: 

/ 4 (a:) = ^-exp(-(a: - a,)^/6,)> 

a^ = (0.2, 0.6) bi = (0.0016, 0.16) 



Figure 3.7: Function U (himodal, unequal spread function). 

Table 3.7 compares the performance of single-point crossover, SBX, and BLX-0.5 on this 
function. Compared to the previous function, the performance of the binary-coded GAs with 
single-point crossover in this function is marginally worse because the global basin is compara- 
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'I'ablo 3.(5: Sitnnlation roaiilts of GAs with difrcrcnt crossover operators on the function /a. 






Perforninncn of 50 riiiiH 


Operator 

Range 

I’opsize 

11 or f 

Success 

Prem. conv. 

Avg. IMals 

1-pt 

(0.0, 1.0) 

100 

20 

49 

1 

765.3 




25 

48 

2 

875.0 




50 

30 

20 

1023.3 

SBX 

(0.0,1.0) 

100 

0 

50 

0 

870.0 




1 

50 

0 

688.0 




2 

50 

0 

660.0 




3 

50 

0 

624.0 




5 

50 

0 

642.0 




10 

50 

0 

680.0 




15 

47 

3 

725.5 




20 

43 

7 

700.0 

BLX-0.5 

(0.0, 1.0) 

100 


50 

0 

670.0 

SBX 

(0.5, 1.0) 

100 

0 1 

4 

4(5 

4500.0 




2 

9 

41 

1400.0 




5 

1 

49 

1800.0 

BLX-0.5 

(0.5, 1.0) 

100 


0 

50 



tlvely smaller. The performance of the real-coded GAs with SBX operator remains unchanged 
except that the SI3X with a small exponent n does not work that well. The reason for its 
failure is the same as that in the previous function with adverse initial population. Although 
some points in the global basin are created, the selection pressure is not sufficient to maintain 
them in subsequent generations. 

The difference in performance of SBX and BLX-0.5 is clear in this function. Since the local 
basin occupies most of the search space, a large portion of the initial random population is 
created in the local basin. About 45 out of 50 GA simulations with BLX-0.5 get confined to 
the local basin and finally converge to the local optimal solution. Even though some points 
are created in the global basin, they are difficult to maintain in the population, because of 
the large disruption rate associated with the BLX-0.5 operator. On the other hand, with SBX 
having a moderate n, the spread of the children points created using SBX is not substantial. 
The population gradually shifts towards the global basin and finally converges to the global 
oi)tiitiiiiii point. 
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'I’ablo 3.7: Simulation results of CAs with dillerent crossover operators on the function /d are 
tabulated. The number inside brackets in column six indicates the number of times the GA 
did not converge to any point at all. 


Operator 

Range 

Popslze 

n ox t 

Performance of 60 runs 

Avg. Trials 

Success 

Prem. conv. 

1-pt 

(0,1) 

100 

20 

45 


817,8 




25 

32 

18 

762.6 

SBX 

(0,1) 

100 

0 

8 

0(42) 

450.0 




2 

40 

6(4) 

1117.6 




5 

50 

0 

730.0 




10 

50 

0 

688.0 




20 

47 

3 

663.8 

BLX-0.5 

(0,1) 

100 


4 

46(1) 

1575.0 


3.6 Pole problem 

The pole problem is a two variable function having four majdmum points, of which one is 
the global maximum. The function is hypothetically constituted to have a light pole at each 
ma.ximum point with certain illumination capability. The illumination is maximum at the pole 
and diminishes away from the pole. When multiple poles are present in the search space, the 
mixed illumination results. For a given distribution of illumination levels in the search space, 
the objective of the problem is to find the point with maximum illumination. The problem 
can be written mathematically as follows (fig. 3.8): 

M«dmke + 

where (a:,',!/;) represents the coordinates of the poles, c,- represents the intensities of light 
at the i-th pole, and /ij represents the height of the t-th pole. In the parlance of optimization, 
the parameter, {xiiyi) represents the t-th optimum point, c,//tj signifies the function value at 
the t-th peak, and hi signifies basin of attraction of the t-th peak. In the above problem, there 
are four peaks with the following parameter values: 



i 


1 

2 

3 

4 

Xi 

0.4 

0.3 

0.7 

0.8 

Vi 

0.3 

0.7 

0.2 

0.8 

Ci 

1.0 

1.0 

1.0 

1.125 

hi 

0.1 

0.1 

0.1 

0.075. 


The optimization problem has a global maximum point at x* — (0.8, 0.8). Table 3.8 shows 
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250 



Figure 3.8: Function /$ (pole problem). 


the performance of single-point, SBX, and BLX on this problem for a termination margin 
€ = 10 - 3 . 

We observe that real-coded GAs with the SBX operator (for 7i between 2 to 5) perform as 
good as the binary-coded GAs with the single-point crossover. However, real-coded GAs with 
BLX-0.5 do not perform as well as other two GAs. 

Prom the above simulations, we observe that the BLX-0.5 operator performs well only 
on well-behaved unimodal functions having a continuous search space, but does not perform 
as well in problems with multiple optimum points. Moreover, the focus in this thesis is to 
compare the SBX operator used in real-coded GAs with the single-point crossover operator 
used in binary-coded GAs. Thus, in subsequent simulations, we compare single-point and SBX 
operators only. 


3.7 De Jong’s functions 

All five Do Jong’s test functions (De Jong, 1975) are tried next. The simulation results for the 
single-point and the SBX operators are tabulated in Table 3.9 through table 3.13. 

Function I (fig. 3.9) is a three-variable unimodal function having a minimum point at 
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"I'ablo 3.8: Simulation results of GAs with different crossover operators on the function /s are 
tabulated. The number inside brackets in column five indicates the number of times the GA 
did not converge to any point at all. 


Operator 

Popsize 

n or d 

Performance of 10 runs 


Success 

Prem. conv. 

Avg. Trials 

1-pt 

200 

20 

6 

4 

3666.7 



30 

9 

1 

3977.8 



40 

8 

2 

2975.0 

SBX 

200 

0 

0 

9(1) 




1 

A 

6 

3,700.0 



2 

8 

2 

3,375.0 



3 

9 

1 

2,866.7 



4 

8 

2 

3,300.0 



5 

9 

1 

3,200.0 

BLX-0.5 

200 


3 

7 

2,933.3 


(0,0, Of : 

3 

hixi) = 53 < 5.12 (3.6) 

t=i ' 

In real-coded GAs with the SBX operator, points outside the specified range may also be 
created. Thus, we assume that the function is also defined outside the above range, but 
the initial population is created in the above range. The simulation results (tabulated in 
Table 3.9) show that GAs with SBX are able to find the minimum point with two decimal 
places of accuracy in all simulations for small values of n. The binary-coded GAs with the 
single-point crossover find the minimum point in only 60% of the simulations. The results for 
SBX is expected because the function is unimodal and continuous in the real searcli space. 
In binary-coded GAs, however, the minimum point falls in a Hamming cliff, which causes 
difficulty to solve the problem in 100% simulations. 

Function II (fig. 3.10) is a two-dimensional minimization problem with the global minimum 

at (1, 1)^ : 

/ 7 (a:i,a: 2 ) = 100(a:? -a: 2 )^-l-(l-a:i)^ - 2.048 < xi, aia < 2.048 (3.7) 

For real-coded GAs, the initial population is created in the specified range. The results in 
Table 3.10 shows that both GAs can not find the optimum point (1, 1)^. In this function, the 
slope towards the optimum point is very small and the population gets converged to a point 
along the ridge. When the SBX operator is performed along a line joining the two parents 
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FI 

Figure 3.9: Inverted, two-dimensional version of De Jong’s (1975) test function FI. 

(approach 3 of section 2.4.4 of previous chapter), near 100% convergence is observed for n = 1 
to 2. 

Function-Ill (fig. 3.11) is a five- dimensional stair-case like function where the function value 

is monotonically nondecreasing as the variables are increased : 

6 

fai^i) = integer (a;,) - 5.12 <»,■< 5.12 (3.8) 

i=l 

Since the SBX operator may create children points outside the specified range, we have 
modified the original function by taking the absolute value of the function creating the initial 
population in the range 0 < x; < 10.24. This results in a minimum point at x,- = 0. In order 
to make the comparison fair, binary GAs are also run on this modified function in the range 
0 < xi < 10.24. Both GAs perform equally well on this function. It is interesting to note that 
despite the discontinuity of the function at many points in the search space, the real-coded 
GAs with SBX are able to find the true optimum. This is because the function is monotonically 
nondecrcasing towards the optimum point and because the population- approach of GAs does 
not allow them to get stuck at any of the plateau. 

Function-IV (fig. 3.12) is a 30-dimensional unimodal function with a zero-mean Gaussian 
noise : 

30 

M^i) = E ix f + Gauss (0, 1) - 1.28 < x, < 1.28 (3.9) 

t=i 

The optimum point of the deterministic function is x, = 0 (i = 1 to 30) with a function value 
equal to zero. But due to the addition of the Gaussian noise, the optimum point and hence the 
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'r;il)l(> ;!.9: 


SimiilaUoii roKiiUK of (_JAk 


with (liUcreul crossover operators on De Jong’s functioii-L 


llange 

Tolerance 

Operator 

Popsize 

n or £ 

Perform, of 50 runs 

Avg. Trials 

Success 

Premature 

conv. 

(-5.12,5.12) 

10-2 

1-pt 

100 

30 

25 

25 

2532.0 





45 

26 

24 

2500.0 





GO 

19 

31 

3254.8 



SBX 

100 

0 

50 

0 

2556.0 





1 

50 

0 

2124.0 





2 

50 

0 

2100.0 





3 

50 

0 

2124.0 





5 

48 

0 

2183.3 





10 

24 

2G 

2420.8 


'■J’ahle .'3.10: Sirrnilatiori results of GAs with difrcrcut crossover operators on De Jong’s function- 
11 are tabulated. The number inside brackets in column seven indicates the number of times 
the GA did not converge to any point at all. 


Approach of 
multi'Var. 

Tolerance 

Operator 

Popsize 

n or £ 

Perform, of 60 runs 

Avg. Trials 

Success 

Premature 

conv. 


10"^ 

1-pt 

100 

24 

0 

50 






30 

0 

50 






40 

0 

50 


Approach- 2 


SBX 

100 

0-20 

0 

50 


Approach-3 


SBX 

100 

0 

40 

0(10) 

4400.0 





1 

50 

0 

3334.0 





2 

48 

2 

2920.8 





3 

41 

9 

2770.7 
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Figure 3.10 : Inverted, De Jong’s (1975) test function F2. 

optimum function value changes. To decide the convergence to the optimum, we have assumed 
a target function value of /x = -3.0. If any point has a function value smaller than /r, the 
simulation is terminated, A termination margin of e = 0.16 is used. Real-coded GAs with the 
SBX operator perform much better than the binary-coded GAs for n = 2 to 5. 

Function-V (hg. 3.13) is a 2-dimensional, multimodal function having 25 minimum points, 
of which only one is the global minimum. The basin of attraction of the global minimum point 
is very narrow, thereby making it difficult for the classical optimization methods to solve the 



1 

I 


I 


Table 3.11: Simulation results of GAs with different crossover operators on De Jong’s function- 
ill. 


Range 

Tolerance 

Operator 

Popsize 

nor i 

Perform, of 50 runs 

Avg. Trials 

Success 

Premature 

conv. 

(-5.12,5.12) 

1.0 

1-pt 

100 

20 

50 

0 

560.0 





40 

50 

0 

696.0 





50 

50 

0 

720.0 



SBX 

100 

0 

50 

0 

1256.0 





1 

50 

0 

S92.0 





3 

50 

0 

748.0 





5 

50 

0 

722.0 


.tIMTRAL LJBRAm 


1 <. T., KANPUR 

A. 
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Figure 3.11: ItivorUul, Iwo-diiiionfiiotia! vo.rfiioti of Do .loiig’H (1975) tost, function F3. 
problem to global optimality : 

^ 1 

fto(xi) = 0.002 -I- T— 2 , ^ ^ ^ 

j=i J + Z^«=:n®« ~ “b'j 

Minary-c.odcd CAs witli the single-point crossover perform better on this function than real- 
coded CAs with the SBX operator. This is so because the location of the optimum points are 
such that the binary coding favors the search process. When the locations of 25 optima are 
randomly placed, binary-coded GAs succeed in only 64% of the simulations, whereas real-coded 
GAs with SBX succeed in 54% of the simulations. 

From these simulations we conclude that real-coded GAs with SBX has performed either 
better than or similar to binary-coded GAs in De Jong’s test functions. In order to bolster our 
conclusions further, we apply SBX operator to two more functions. 



3 

i 


3.8 Rastrigin’s function 


Rastrigin’s function (fig. 3.14) was used by a number of GA researchers in the recent past 
(Muhlcnbein, Schomisch, and Born, 1991; Gordon and Whitley, 1993). The function has 20 
variables and contains 10^° local minimum points, of which only one is the global minimum 
point: 


20 

f\i{x) = 200 + - 10cos(27ra;i), -5.12 < a:,' < 5.12. 


•=i 


' i 

h 1 


.I- ‘ 



(3.11) 
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Table 3.12: Simulation results of GAs with different crossover operators on De Jong’s function- 
IV are tabulated. The number inside brackets in column six indicates the number of times 
the GA did not converge to any point at all. 


Tolerance 

Operator 

Popsize 

n or £ 

Perform, of 50 runs 

Avg. Trials 

Success 

Premature 

COttV. 

0.16 

1-pt 

100 

240 

4 

42(4) 

15200.0 


SBX 

100 

0 

0 

0(50) 





1 

0 

0(50) 





2 

50 

0 

17032.0 




3 

50 

0 

12464.0 




4 

50 

0 

10274.0 




5 

50 

0 

9372.0 


Table 3.13: Simulation results of GAs with different crossover operators on De Jong’s function- 
V are tabulated. The number inside brackets in column seven indicates the number of times 
the GA did not converge to any point at all. 


Tmax 

Tolerance 

Operator 

Popsize 

n OT i 

Perform 

of 50 runs 

Avg. Trials 

Success 

Premature 

conv. 

500 

0.1 

1-pt 

100 

■SI 

48 

2 

756.2 






50 







40 

49 

1 

■IH 



SBX 


0 


2(41) 






1 








3 


17(2) 






5 

36 

14 






8 

39 

11 

892.3 





10 

37 

13 

891.9 
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Figure 3.12: Inverted, two-dimensional version of De Jong’s (1975) test function F4. 

The global minimum point is at a:,- = 0 for alH = 1,2,..., 20. At this point, the function 
value is zero. The other local minima occur when the cosine term is one. This function is a 
diflicult optimization problem, because there are 2^“ different local optimal points (surrounding 
the global optimum) with function values very close to that at the global optimal point. Since 
it is relatively a more complex problem, we allow maximum number of generations Tmax equal 
to 500. We set c = 0.1 and the target function value fx = 1.0. This allows a termination of 
the simulation when at least 19 out of 20 variables have converged to their global minimum 
values. 

Binary-coded GAs with equal substring length for all variables are applied first. In all 
ten simulations, binary-coded GAs prematurely converge to a wrong solution (Table 3.14). 
Real-coded GAs with the SBX operator perform better than binary-coded GAs. The average 
function values of the best solution in all simulations are in general better compared to that 
obtained for binary-coded GAs. The success of real-coded GAs in this problem is due to the 
controlled allocation of children points under the SBX operator. With moderate n, the children 
points are not far away from the parent points. So the search slowly progresses towards the 
optimum point. It is worth mentioning here that with a tolerance of 0.1 in each of 20 variables, 
a enumerative search method requires about [(5.12— (-5.12))/0.1]^° or 1.48(10'*°) function 
evaluations. Real-coded GAs took only a fraction of this search space to locate a near global 
minimum point. These results show the efficacy of real-coded GAs with the SBX operator in 
solving multi-variable, multi-modal functions. 




1 


t 

k 


\ 


I, 
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Figure 3.13 : Inverted, De Jong’s (1975) test function F5. 


3.9 Blocked function 


A two-dimensional blocked function (%. 3.15) is designed according to Goldberg’s (1991) 
suggestion. The mean (over ) and global slice (at «*) of the function are plotted in figure 3.16. 

The global maximum point lies at the point (0.4, 0.45)^. Two local hills are placed on 
either side of the global optimum. In the global slice (taken at Xi = 0.4), the global maximum 
point is surrounded by two local maximum points, but the average slice (averaged on xi values) 
is predominant by the local peaks. It is interesting to note that the search along the variable 
xi is unimodal. Thus, the population is expected to converge at xi = 0.4. Once most of the 
population converges on or near = 0.4, the search progresses along the global slice. Since the 
population is converged to the extreme local peaks, it becomes difficult for classical algorithms 
to find the global point. The function is given below: 


fnixi ,X2) = -{xx- OAf + 


+ r,{xi - 0.4)2 ^ (a;^ _ a) 


2 ‘ 


(3.12) 


Here, the following parameters axe used: 



* 



2 

3 

4 

5 

a; 

0.002 

0.0025 

0.014 

0.003 

0.0028 

bi 

0.002 

0.0020 

0.003 

0.001 

0.0010 

a 

0.1 

0.9 

0.45 

0.27 

0.65 


0 

0 

10 

10 

10 




Figure 3.14: Contour plot of 2-diineiisional version of function fn (llastrigin’s function). The 
central basin is the global basin. 


Situulatiori results with bin ary- coded and real-coded GAs are tabulated in Table 3.15. 
Although the binary-coded GAs with single-point crossover converge to the global optimum 
point (with three decimal places of accuracy) only 5 to 15 times, they converge to the global 
basin in about 70% of the simulations. Better results are observed with the real-coded GAs 
with SBX; in at least 80% of the runs, they have converged to the global maximum point. In 
all successful simulations of real-coded GAs, it is observed that the variable a;i converges to the 
true optimal value first, as predicted by Goldberg (1991). Subsequently, the search along the 
global slice continues and finally, finds the global maximum point. Whenever real-coded GAs 
have failed to find the global optimum point, the convergence to the variable X 2 has occurred 
first. When this happens, the population shifts in any one or both the peaks shown on the mean 
slice plot in figure 3.16. Since the function on xt is unimodal with maximum at = 0.4, the 
solution converges to a wrong solution. Although the SBX operator works on real parameter 
values, its search power is similar to that in the single-point crossover. With a moderate n, the 
population located at two extreme optima improves gradually and can overcome the barrier of 
the local peaks to finally converge to the global optimum. 




Table 3.14: Simulation results of GAs with different 
are tabulated. 'Die mittiljer inside lirac.kets in (•.oliimn 
GA did not converge to any point at all. 


crossover operators on the function /n 
five indicates the number of limes the 


Operator 

Popsize 

n or E 

Performance of 10 runs 

Avg. Trials 

Avg. function 
value 

Snccc.ss 

Prcin. conv. 

11 in ary 

400 

200 

0 

10 


11.32 



300 

0 

10 



SBX 

400 

0 

0 

0(10) 


224.5 



2 

10 

0 

49,080.0 

6.81 



3 

8 

2 

51,750.0 

2.92 



5 

6 

4 

39,866.7 

2.39 



10 

10 

0 

28,000.0 

3.77 



15 

10 

0 

27,280.0 

3.15 



20 

10 

0 

23,360.0 

3.08 


Table 3.15: Simulation results of GAs with different crossover operators on the function /12 
are tabulated. The quantity in bracket indicates in column five the number of times GAs have 
converged to the global basin, but not to the global optimum. 


Operator 

Popsize 

n OT (■ 

Performance of 50 runs 


Success 

Prem. conv. 

Avg. Trials 

Binary 

100 


5 

1 i 

1,480.0 



mm 

15 

1 f 

1,753.3 



40 

10 


1,550.0 



50 

6 


1,616.7 

SBX 

100 

0 

42 

wmtM 

2,323.8 



'1 

43 


1,927.9 



2 

47 


1,872.3 



3 

43 


1,795.3 



5 

40 


1,677.5 



10 

38 


3,091.7 
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0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 


X2 

Figure 3.16: The mean slice and global slice (at a:*) of tbe function /12 are shown. The average 
function value over the search space is 0.287. 
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3.10 Summary 


This cliaptor presenlccl the siuuilatiou results of real-coded GA using SBX vs binary crossover 
on a number of teat functions including De Jong’s functions. The results obtained here show 
tliat SBX lias performed eitlier l)etter or similar to binary crossover in all test functions. It 


has been found particularly useful in problems, where location of global solution is not known 
beforeliand. h’lirtlier, SBX lias l)een ap|)lie<l HiiccesHrully to solve a blocked function introduced 


by Goldberg (1991). These results present a confidence, with which we begin at applying SBX 
in an application problem in the next chapter. 



Chapter 4 


Ambiguous Shape Modelling : 
Application of GAs 


This chapter addresses to the second part of the thesis, i.e. ambiguous shape modelling. Here 
wo discuss the problem and its signUicauco in CAD systems. It is also shown how GA with 
SUX is applied in this problem. 

4.1 The design process and present role of computers in de- 
sign 

DesigniR a widely used term to denote the formulation of a plan for the satisfaction of a human 
need. According to one definition (Peters et.al., 1990) ; 

Design is a creative operation of products, production processes or more generally 
of humanity serving systems using available materials, products and processes, 

aiming to achieve a specific function, responding to the demand of the customer and 

. ^ 

compromising between conflicting constraints as cost, delivery, delay and feasibility. 

Any design process normally goes through several phases, which can be broadly classified 
as (Saiulor, 1964): 

1. Confrontation with the problem 

2. Formulation of the problem 

3. Conceptual design of the system 

4. Synthesis 

5. Generation of analyzable model 

6. Experiments, analysis and optimization 

7. Presentation 
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It is to be noted that the above phases are not a linear step-by-step procednxe. In a real 
design process, these steps may be required to be repeated in any order, and the designer needs 
to go back and forth on these steps. 

In a real sense, the present CAD systems- are used only for drafting and not for designing 
(McCullough, 1990). That means that present computers systems can’t assist the designer in 
the conceptual design and synthesis process. The designer has to first formulate his entire plan 
and generate an analyzable model (step 5); only then the computer can be used to analyze the 
design, provide a visual display and optimize it. At this point, the design may already be too 
constrained so that conceptually different designs can not be represented in the search space 
and are thereby excluded. This is a serious problem in aU design optimization efforts. In the 
following sections, we focus our attention at the second phase of design (conceptual design) 
and show how the role of CAD can be extended to even this phase. 

4.2 Quantitative vs qualitative models 

Almost all the present CAD systems use quantitative models to represent object geometries, 
storing the exact information about its boundary and attributes. In fact, one of the well- 
established requirement of CAD models (Requicha, 1980) is “unambiguousness" , i.e. a single 
model should represent only a single geometry. However, there are other types of models 
possible — so-called qualitative models, which are less precise in nature. In a typical qualitative 
model, quantities are represented from a very coarse and functionally relevant discretization 
such !is ( — ,0,+), ms opposed to the full quantitative specification. This results in the i 
models being ambiguous^ i.e. a class of objects are represented by a single model. 

Qualitative models are important in a number of areas such as function modelling and 
abstraction, pattern recognition and conceptual design. In this work, we are interested in 
applying qualitative models in shape modelling. In the next section, we look at some basic 
techniques on which such a qualitative model can be built. 

4.3 Qualitative shape modelling 

As stated earlier, qualitative models use qualitative zones to represent parameters of the model. 
In this thesis, we limit our concern with models of 2-D shapes only. Even though qualitative 
models are beneficial in a number of areas, their applicability in geometrical applications has 
remained limited due to problems such as inability to reconstruct the shape for visual display, 
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a piol)lom Uiat can be Uirnicd an inexact, visualization (King aiui Mukorjeo, 1!)90). Since the 
model rcproKOiilH not, one but many shapes, the user is unable to visualize the shape being 
described. Hence an appropriate representation scheme should be used to base the qualitative 
model, such that It i,s possible to reconstruct the shape. A qualitative model can be built 
upon different types of representation principles. There are many popular shape modelling 
techniques (U.euiuicha, 1980) each based on a different principle : 

1. Boundary representation in terms of absolute co-ordinates of vertices (Absolute bound- 
ary) 

2. Boundary representation in terms of lengths of each edge and relative angles made with 
previous edges (Boundary clutin) 

3. Storing the pixels (bitmap) that constitute the shape (Uniform grid) 

4. Quadtree model (Hierarchical grid) 

f). Boolean operations among certain predefined primitives (Set theoretic) 

6. Medial Axis 'ftaiisform (Axis based) (King, 1991; Blum, 1967) 

It is to be noted that an appropriate representation should be invariant under geometric 
transformations, stable and robust. Rirther, the qualitative version of the model should be 
able to store the basic abstractions of the shape. At the same time, it should be recognizable, 
i.e. the system should be able to display at least some of the typical shapes represented by 
the model. Our objective here is to select a modelling technique in which we should be able 
to model a class of geometries by representing the various parameters associated with the 
model, into qualitative zones. All the modelling techniques described above except the last 
one (axis based) are found to be unstable and hence inappropriate. Medial axis transform has 
the unique property that it can be redefined to form a valid qualitative model. Recently, work 
has been done on recons timetable ambiguous shape modelling using qualitative medial axis 
transform approach (King and Mukerjee, 1990). However these results are limited to convex 
shapes only and do not present applicability for a general shape. Partly this limitation is 
due to the difficulty in solving the constraint satisfaction problem using traditional methods, 
which is why we have chosen to apply the new GA method to this problem. In this work, 
we concern ourselves only with developing a qualitative medial axis transform (QMAT) with 
reconstruction and visualization capabilities for handling general shapes including non-convex 
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Cq) (b) 

Figure 4.1: Vorouoi diagram for (a) point sites, (b) line sites. The thick lines in fig. (b) 
represent ' line sites. 

ones (Agrawal, Mukorjeo. and Deb, 1995). The key notion used here relate to that of line site 
voronoi diagram which is a topic for the next section. 

4.4 Voronoi diagram and medial axis transform (MAT) 

Voronoi diagrams have been studied extensively in computational geometry and are used in 
a range of a])plicatioiis including robotics, geography and facility planning. Given a set of N 
point sites Si in a plane, the voronoi diagram of this set is defined as the locus of a point 
which is equidistant to at least two nearest point sites. The voronoi diagram divides the plane 
into exactly N regions, each corresponding to one site. These regions have the property that 
the nearest neighbour of any point in the region Rp will be point site p, where Rp denotes 
the region corresponding to point site p. For example. Fig. 4.1(a) shows the voronoi diagram 
for some points. The voronoi diagram of point sites consist of only straight lines. K some of 
the sites are line segments, the V.D. will, in general, consist of lines and portions of parabolas 
(Fig. 4.1(b)) 

Medial axis transform (MAT) of a closed polygon is defined as that part of the voronoi 
diagram constructed by treating the edges of the polygon as line sites, that lies inside the 
polygon. It can also be regarded as the locus of the center of inscribed circles that touch at 
least two edges of a polygon. For all polygons, the MAT is a planar graph consisting of straight 
lines and parts of parabolas. Various algorithms for computing the MAT are available in the 
literature (Fortune, 1987; Yap, 1987; Tiwari, 1986), Some sample polygons together with their 



64 



Figure 4.2; Some sample polygons with their MATs. 


MAT are shown in (ig 4.2. 

It is to bo noted that mapping from polygon to MAT is one-to-one, i.e. given a MAT, its 
polygon can be generated nni<iuely and vice-vorsa. lloncc, tlio norma,! MAT can be used to 
roprosent the (piantitative or exact shape. A detailed description of this application can be 
found in (IJhim, 19G7). In this work, wo explore the issue of using MATs to represent inexact 
shapes. 


4.5 Hybrid qualitative medial axis transform 

An MAT is a planar graph which can be represented by a list of links which are segments 
along the MAT, or fragments of the line-site- voronoi-diagram. Each link is associated with the 
following information: 

• length 

• radius function; 

• angle with parent 

For example, fig. 4.3 shows a typical MAT. AB, BF, BC etc. are different links of MAT 
which are represented by their end points called nodes. Even though the radius function can he 
of any arbitrary form, wo assume linear interpolation function in the current study, thus it will 
be sufFicient to specify the radi only at nodes (start-node-radius, end-node-radius). Further, 
we define the longest link in the MAT to be wot link. The Z of a link is defined by the angle 
made by this link to it’s parent link. For example the angle of link BC is the LCBE. 
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Figure 4.3 : A typical medial axis transform. Note that radii at nodes D and F are zero. 

Our objective here is to use the MAT as the basic model on which to apply a qualitative 
mapping. The qualitative model will then relate ail the link lengths and angles using a 2D 
spatial reasoning model (Mukerjee and Joe, 1990). In the purely qualitative version of the 
MAT, the lengths of each link in the MAT, their relative angles and their radii are stored as 
qualitative zones from (-,0,+) and not as a single value. The problem with such a model 
is that the discretization, e.g. all negative numbers into a single class, is too coarse, and the 
resulting geometries are too diffuse to be useful for reconstructing or even usefully constraining 
the shape. For this reason we use a hybrid qualitative algebra (Mittal and Mukerjee, 1995) 
in which the qualitative regions are subdivided into regions that are at the right level of 
discretization for the task. Picking up values from these zones and constructing the polygons 
accordingly leads to generation of different but qualitatively similar shapes (fig. 4.10), If these 
uncertainity zones are reduced to a point, the model reduces to a quantitative model. On the 
other hand, widening these zones results in more diffuse geometries. Note that an appropriate 
level of discretization depends upon the stage of design; initially a designer will like to have too 
wide zones of uncertainity to test various possibilities, and as the design tends to be finalised, 
the level of discretization will be made finer and finer. 

In the qualitative version of the MAT, often some links which may not be so important, can 
get deleted (King, 1991). This usually results in a smoother and simpler shape (fig. 4.4(a)). 
Also to make the implementation simpler, we assume that all the parabolas of the MAT hai-e 
been approximated to straight line segments (fig. 4.4(b)). 
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'I/I: A pproxiiiuU/ioiiH in QMA'I' (a) l{.<Mnoval of nniini)()r(.aiit, links, (l>) Approxiiiialion 
of )):i,ral)olii.H l.o Hl,r!i,if>;li(, linns. 





Figure 4.6: A single link MAT becomes infeasible by link length being small. 

4.6 Feasibility and consistency of the QMAT 

The problem with having inexact zones for the MAT parameters is that these zones may be 
inconsistent among themselves, i.e. often a randomly generated shape by the model may be 
inconsistent or infeasible. An MAT is infeasible if no boundary can be constructed for these 
axis specifications. For example, in fig. 4.5 a single link has a very large radius on one side, 
and it would be impossible to define the boundary. Similarly, fig. 4.6 shows another infeasible 
case, wliere a part of l)oiindary is included in the circle at one end. Here also, the boundary is 
undefined. 

An MAT is said to be inconsistent if a boundary for the MAT can be generated but the 
MA4’ of this l)ounda,ry is different from input MAT. Consider, for example, the case shown in 
fig 4.7. Because of the nonconvexity of the shape, there will be some portions of parabolas in the 
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Figure 4.6. AiiotliGi infos-siblG bscs-uss of Gnd rs-di bGcoming too bigb. A psuxt of bouiidsiTy 
is included in the circle. 





Figure 4.7: An example of inconsistent MAT, (a) Input MAT (b) Reconstruction of contour (c) 
MAT of generated contour. Note that fig. (a) and (c) are different in the sense that regenerated 
MAT contains a part of parabola. 


MAT, but they have been, approximated by straight lines. Thus, a consistent boundary can’t 
be generated by this MAT, because the resulting polyarc would have an MAT that is different 
from that which was input. However, the parameters of the MAT being varied independently 
to each other, there is very little control on the contour and such inconsistencies are hard to 
detect. 

To eliminate such inconsistent / infeasible shapes and also to generate desirable shapes, 
we have used penalty functions coupled with real-coded GA using SBX. Each individual of 
GA represents an instance of shape generated by QMAT model. Thereafter, it is evaluated in 
terms of its fitness criteria. 

4.7 Reconstruction from the MAT 

Once a feasible MAT has been found, its corresponding boundary representation can be ob- 
tained by taking the envelope of the circles of appropriate radii. In our case, since these radii 
are linearly interpolated between the endpoints, this envelope is the tangent between the two 
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Figure 4.8: The reconstruction process, (a) The input MAT (b) Tangents on circles defined 
(c) Tangents trimmed (d) Final contour. 

endcircles. Essentially, the process of reconstruction start from the root link and then recur- 
sively traverses to eacli cliild link. For a particular link, the tangents on the circles at end 
nodes are calculated, Thereafter, arcs and tangents are trimed to generate the enevelope of 
all tangents and circles, to define the boundary. The process of reconstruction is illustrated in 
fig. 4.8. 


4.8 Fitness criteria for GA 

Any MAT generated from the class of a given hybrid QMAT can be evaluated with respect 
to its viability, measured in GA with the fitness criterion, which can be defined according to 
particvdar application requirements.. The objective then becomes to select a shape which is 
most desirable (corresponding to maximum fitness or minimum penalty). Since GAs use a 
population of solution points, it is possible to deal with many shapes simultaneously. Every 
random MAT generated by GA is evaluated and given a suitable penalty value according to 
its degree of inconsistency / infeasibility and some other user-defined criteria. Our present 
implementation shows the results of a three-link case, where MAT consists of three links 
meeting at a point. The longest link has been taken as the reference length, and the length 
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Figure 4.9: The b:usic geometry of a tliree-Iink MAT. The length of root has been taken as 
reference. 


ratios of tlie othtir two are tl and ( 2 . Similarly the radius ratios at the four nodes are rl through 
r4. The two aivgle.s made by £1 and £2 with the reference link are 91 and 92 (Figure 4.9). A 
qualitative version of this model will have all its parameter -values specified in an uncertainity 
zone, which are shown as hatched areas in fig. 4.10. This figure also shows some of the shapes 
of the class represented by this model. Note that specifying other uncertainity zones on the 
parameter values of this model will result in a different class of shapes. For example, another 
class of shapes is shown in lig. 4.11 which relate to the rocker arm used in I.C. engines. 

4.8.1 Penalties due to infeasibilities 

As discussed in previotis section, some MATs generated from the QMAT model may be in- 
feasible, Such instances can ho eliminated with a penalty function which increases with their 
degree of iiifcasiblUty. This penalty is usually of greater significance than penalties for con- 
tour properties. Minimization of summation of penalties in GAs guides the search to feasible 
shapes. We consider the two following cases as infeasible. 

1. Circle inclusion ; (Fig 4.7(a)). Tliis is an infeasible case, because the tangents on the 
circles are undefined. Hence, it must be discarded by GA. For this purpose, we ^ve a 
penalty value : 

Cl> = Cinf ^1.0 + £ < Tmox - r,nin 


Figure 4.10: Qualitative version of three-link MAT, together with some of the shapes generated 
by this model. Note the abstract similarity between the shapes. 



Figure 4.11: Some sample contours for an I.C. engine rocker arm. These were generated by 
the hybrid qualitative medial axis transform model. 
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where Tmax ^min refer to the bigger and smaller radius respectively and £ to the length 
of link. Cinf is the infeasibility coefficient setted by user (default value is 10.0). The above 
equation is chosen because it gives a non-zero monotonically increasing penalty with 
increasing infe<isibility (Fig. 4.12 a).We can use any other function with a monotonically 
decreasing penalty as we move towards the feasibility zone. 

2. Boundary inclusion : (Fig 4.7(b)). This is also an infeasible case, because the bound- 
ary of the contour is undefined at the inclusion point. This type of case is penalized by 
the following penalty function : 

t=i 

where (/)(i) is calculated w.r.t. to vertex of the boundary and the circle as below : 

p(t) = I ~ (^) 

( 0.0 otherwise. 

Hero di is the distance of vertex from the center of the circle and r is the radius 
of the circle. The coefficient P<.,v and Cdr can be set by user (default values are 20.0 
and 5.0). The penalty is a smooth function which increases as the infeasibility increases 
(Fig 4. 12 b). Note that the penalty function increases as the included portion of boundary 
comes more near to the centre of circle. The penalty is equal to zero, if none of the portion 
of boundary is included in any of the circles. 

4.8.2 Application-oriented penalties 

In addition to the penalties discussed above, a number of other properties may be relevant 
depending upon the application. The user is given the flexibility to set different weights of 
these penalties, according to bis requirements. Thus, if some weight is set to zero, the property 
corresponding to it gets effectively removed from consideration. This section illustrates some of 
the different contour properties that can he considered; note that the user can easily incorporate 
other functions of arbitrary complexity. 

1. Penetration of generating circles : Too much penetration of the circles leads to near- 
circle shapes and small changes in the radii may even result in infeasible (Fig 4.5). The 
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(q) 


(b) 


Figure 4.12: Poniilties for infocisible MATs, (a) for circle inclusion, (b) for boundary indusion. 

penetration criterion is modelled as a penalty function that increases as two circles on 
the same link penetrate each other: 

d) - ( for i <ri + r 2 
\ 2rmax ) 

where r„i„a, is the greater of (ri,r 2 ), and I is the distance between the two centers f li-nk 
length). 

2. Non-convexity ; In certain applications, non-convexity in the shape may be undesirable. 
The teat function measures non-convexity in terms of reflex angles. If the reflex angle 
is more than tt, we give a penalty equal to {angle — tt)^ else it is zero. This penalty is 
summed over all the reflex angles of the contour. Note that this is just one method of 
penalizing the non-convexity and there can be many possible methods to do this. 

3. Number of edges : In general, a simple shape (containing lesser number of edges) may 
be preferred over its more complex counterparts. To model this criterion, we generate 
a penalty function equal to fc X iV where fc is a normalization constant (we have used 
k =: 0.1) and N is the actual number of edges in the contour. 

4. Area of the polygon ; This may need to be either minimized or maximized. Area can be 
computed based on the final contour obtained from the MAT instance, and the penalty 
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Figure ^.13: Examples of (a) smooth shape (b) non-smooth shape, generated by same MAT, 
but with didercnt radii. 


set at wvifjhi. X area if minimizing (Weight is 
maximized), 


given a negative value if area is to be 


6. Suwalhncss : ,Sha,p(\s with many sudden changes in the angle may be less desirable. 
Fig d.l3 sliows exam[)les of smooth and non-smooth shapes. To model this criterion, we 
traverse the whole contour, keeping track of change in the orientation from one edge to 
other. The penalty function is a summation of squares of these angle changes. Thus 
smootli sha|)e.s (with le.sser change in angles) are given less penalty and sharp shapes are 
highly penalized. 

6. Aspect Ratio ; Often, as in the example of the knife in fig. 1.1 the aspect ratio of the 
sliape (lieight/width) inay be an important functional parameter required to fall within 
some pre, spec! lied range. The penalty function for this property first calculates the actual 
a,si)ect ratio of the shape generated, and if tlie a.r. is out of the specified range, it sets 
a penalty value e(|ual to scpiare of deviation from the bounds, otherwise no penalty is 
given. 'File a.spect ratio is calculated in the frame dolined by the longest axis, i.e. the 
width along the longe.st axi.s link, and height along a perpendicular direction. 

7. Parallelism of edges : In some applications (especially engineering applications and other 
artifacts) some edges may be exactly parallel. If this penalty function is invoked, it checks 
the angle between the orientation of two tangents of a link. Then the penalty is calculated 
as follows where 6 i.s the angle between the two edges: 

( CpO if 0.0<^<ai, 

(j){i) = < Cptti if oi < < «2, 

0.0 othorwi,se. 
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Figure 4.M: PenaUy fuiirtion for (a) parallelism (b) perpendicularity, x-axis on both figures 
represent.s the angle 0 between two edges being compared. 

The coefficient C,, used in this and following equation is a slope factor (we have taken it 
as 5.0). Angle! constants cri and tt 2 are some predefined threshold values (we have used 
0.2 and 7r/4 respectively). This penalty is as shown in fig. 4.14(a). The effect of this 
penalty is that there is no discriitiination among shapes having 0 > qi as far as 
this penalty is concerned. This is done because if the angle between the tangents is more 
than the threshold value, they can’t be termed as “nearly” parallel, and are therefore, 
equally good. 

8. Perj)(mdu:xdarity of (’diieH : Similar to previous one, some applications may need an exact 
perpendicularity between two neighbouring edges. Calculation of this penalty is similar 
to the previous one, except that the ranges for B have been changed. 

[ Cp (7r/2 - e) if 7r/2 - ai < 0 < 7r/2, 
f/i(0 = S 7r/2-a2 < ^ <7r/2-ai, 

I 0.0 otherwise. 

This penalty is «U 5 shown in fig. 4.14(b). Similar to previous one, this penalty function 
treats all those cases same which have angle d less than threshold value, since all such 
cases fall out side the definition of “nearly” perpendicular. 

In the reconstruction process of a given QMAT, some axis segments may have been deleted, 
resulting in uncertainity about the edge profiles there (see fig. 4.4(a)). In the GA, this can he 
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Figure '1.15: I/menriziiig an arc (a) wiUi cxUaidod 


taagonis, (h) with truncated tangents. 


adjusted hy tlin u.ser, who can s|)ocify how the arcs of the contour will be displayed. These 
arcs may emerge l)e<'au,s(> of nmioval of some links, and/or having a non-zero radius for an end 
node. The arcs may remain arcs or they can he converted into line segments depending upon 
following two parameters : 

1. Fxtemh'd tangent limmrity : If tins is set ecpial to 1.0, all the arcs having sweep angle less 
than 7r/2) arc' converted into two ('xtended tangents. This is shown in fig Tl,5(a). 


2. Truncated tangent linearity : If this is set eciual to 1.0, all the arcs having sweep angle 
less tl\an 0Oi(’= tt) hut greater than (”)t/(= 7r/2) are converted into three line segments, 
the middle one being the tangent at mid point of the arc. This Is shown in fig 4.15(b). 


If the.se degrec^H are set to ().(), all arcs remain arcs. Any value of these parameters in the 
range (0,1) lea<I.s to some of the arcs being converted to lines depending upon their sweep 
angles. It results in a more angular look to the shape. 

In some sen.se, non-conve^xity in the contour dictates inaccuracy in the input MAT, be- 
cause MAT of such a coiiLoiir will have, in general, some portion of parabola. The present 
implementation consists of only straight lines. We are allowing this approximation to make 
the imi)lementation simple'. 


4.9 llesuKis 

As discussed earllc'r, we leave used atJA approacli to fimi a (lesiral)le sliapo from QMAl moded. 
Tlie objective function fur minimization is tlie summed penalty functions discussed in ptcvioiis 
section. We liave used (lA witli SIJX cro.ssover operator with binary tournament selection. 
For testing purpose, wo have delined some test functions according to piovious section and 
generated .some shapes, We have used following parameter values which were kept constant 
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for ail nnus’. 

1. |'()|)iilii(,iuii ; .'■)() 

2. (Jro.ss ovc'f pr(il)al)ili(.y : 1.0 

3. Mutation prol)ability : 0.01 

4. Max. immixu' of p;(’iioratioii.s ; ■'(() 

5. l';xpon(uit(n) : 2.0 (Fixod throughout) 

In tho followiiiR to.st ca.st’.s tlu' UHor h a-ssuinod to havo provided ranges of variability as.so- 
ciatod with on, eh para, meter in tlie, model. Thus, by fixing the angle between two links but by 
allowing some variation in the radii, he/she can emphasize the functional relevance of the angle 
while stating tha,t tlu' a<-l,ua,l shape and other dimensions arc not as important. Furthermore, 
the, ii.ser can Hpeeil'y tlx' optimizittioii criteria to be used by the GA (by setting appropriate 
woigiit.s to (uu'.li pmialty riinciion.s) and after viewing the results, he/she may change either the 
original input or the litne.s.s criteria. To illinslrato this procedure, we will trace the development 
of several theim'.s a li.sh like .shape, an b-.shajied bracket and a long forking building. In each 
of tlKwe in.starice.s, a .set of (JMAT data i.s provided by tho inser (Table 4.1), and we show the 
result, s obtained by adjii.sting tli<' litne.ss criteria. In each instance, the numbers in parenthesis 
represent the range of value.s possible in the IIQMAT at tlic desired level of inexactiio.s.s. 


' 1 'a, b 1 (' 4 . 1 : M /VF^ data used for example shapes 


Vai'ia])i(' 

Fish 

1) racket 

Building 

7'1 

(()..5,(1.H) 

(0.0,0.5) 

(0.1, 0.3) 

r‘l 

(().(), ().:i) 

(0.2, 0.2) 

(0.1, 0.3) 

r;i 

‘ (0,(),{).;j) 

(0.0, 0.f)) 

(0.1, 0.3) 

)'4 

((i.(),().:i) 

(0.0, 0.0) 

(0.1, 0.3) 

fl 

(0.7J).7) 

(0.3,1 -0) 

(0.3, 1.0) 

n 

(0.7, 0.7) 

(0.1, 0.5) 

(0.3,1.0) 

(n 

(db,!)!)) 

(90,90) 

(45,45) 

02 

("HO.-d,')) 

(-45,-45) 

(-45,-45) 

Fitnes.s (Jrito.rioii 



Non-convexity 

smoothness 

parallelism 
porpondicniarism 
aspect ratio 

parallelism 
perpendicularisin 
a, sped ratio 
area 


'llc)W(!V(M', it may Im a good idea to cliaiige these pariuue-ter 
of GA runs in this particular application. 


valiKM ami investigate its cICccls on performauce 




Figure 4.16: The final fish-shape obtained by GA. 

4.9.1 liiological shape (fish) 

Here the user in interested in a bulbous shape for the head area and a not-too-angular tail. 
Sinoothne.ss in the final shape is desirable since biological shapes are smooth. Also, non- 
convexity has been used here to reduce sharp angles in the tail (non-convexity increase with 
the reflex angles). The head radius is significantly larger than the other radii, and linearity is 
not desirable since the shape is to remain rounded. A class of fish shapes generated by these 
values were shown in fig. 4.10, while the final version of shape is as depicted in lig. 4.16. 

4.9.2 Bracket cros,s-Section design 

Hero we are interested in designing a bracket to support a shelf, say. The shape of it is rouglily 
like an inverted “L”. Fig. 4.17 shows some possible shapes which are obtained by using the 
parallelism and por])cudiculari8m constraints and the aspect ratio (how wide is the shelf vs. 
how inncli wall support is desirable). 

One point to be noted hcTo is tliat in a conceptual design system dealing with such objects 
the user may be allowed to add other functional constraints, such as the strength constraint 
rejiresenting llie load to be carried by the wall and the strength of the walls to support it. These 
functional specidcations can bo automatically optimized along with the shape in obtaining valid 
designs (Mukerjee and Srivastava, 1994). 

4.9.3 Building design 

Consider the ground plan of a building dial, is to have a long central part, with two branching 
corridors at one end. In this example, the relative lengths of the corridors are controlled by 
the aspect ratio parameter, and the widths by the area parameter (fig. 4.18). All test cases 
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Figure 4.17; Tlie bracket cross-section is an L. (a) . . .(e) shapes from early generations of GA 
run. (f) final shape with aspect ratio = 0.8 (g) shape with a.r. = 0.5 (h) shape with a.r. = 
0.5 (linearity turned off) (i) shape with a.r. = 0.8 (linearity turned ofi). 

use parallelism and perpendicularism. The degree of linearity is kept 1.0 to allow linearized 
shapes. 


4.9.4 Some more test cases 

In order to bolster further our claims, we have generated few more test cases. Fig. 4.19 shows 
a test case where area minimization alone was used as GA fitness criterion. The same input 
model wa.s used to generate another test case with area maximization as the fitness criterion. 
Further, we claim that our three-link MAT model itself is able to represent a wide variety of 
shapes. Some of these can bo seen in fig. 4.21. However, it is worth mentioning here that a 
general shape may have an MAT consisting of more than three-link. Qualitative models for a 
generalized case can be developed and tested along same lines. 

4.10 Summary 

In this work we have illustrated an innovative approach for representing ambiguous or ill- 
defined geometries. The model shown here uses as input an axial model of the shape. 1 he ba,sic 
technique has a number of possible applications — conceptual design visualization, models for 




Figure 4.18: Sliapc.s lor forking building, (a) • • .(c) Shapes from early generations of GA run. 
(d) final shape for aspect; ratio =1.0 together with area maximization constraint, (e) shape 
for a.r. = 0.5 with area luiuimization (f) shape for a.r. = 1.0 with area minimization. 



Figure 4.19: A test case for area iriinimization. First three ligures are early samples of GA 
run. d’lic last sliape is optimized. Note that the mscr has defined the nncertainity zones in a 
manner that no better shape (with lessor area) is possible. 



Figure 4.20; A test case for area maximization. First three figures are early samples of GA 
run. The last shape is optimized, Note, that the user has defined the uncertainity zones in a 
manner that no better shape (with more area) is possible. 
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Figiiro 4. ‘21: A vaiidliy of sliapo.fl can be represented by three-link MAT. (a) An arm shape (b) 
A bracket shape (c) A triangle (d) An arrow (e) A pistol shape (f) A dumbell shape. 


computer vision, a,nd any other application involving sketch work or where shapes arc to be 
visualized. 'I’lie genetic, algorithms approach used here provides a mechanism for searching 
non-monotonically in an extremely nonlinear domain. Also, by being able to visualize samples 
from all generations, the system also provides the user with some idea of the variations possible 
on the basic shape constraints. 



Chapter 5 

Extensions 


The study inade in this thesis opens a number of ways for possible extensions. We discuss the 
extensions ofCA part and MA'r i)art separately. 

5.1 Extensions (real-coded GAs) 

1. Other tcchniqncfi for honiulcd ticarch space 

In the j) resent study, we have either allowed the children points to lie anywhere between 
?iegative inlinity to positive infinity, or have restricted them to remain within boiiirded 
search space using scaling tocliniquo (see section 2.^1. 3 in chapter 2). However, This scaling 
creates an artificial bias of the children points towards middle of the search space. It 
would be interesting to investigate this phenomena in more detail in binary GA, and 
then simulate that in SIIX. 

2. Study of effect of exponent n on OA pcrforimne.e 

In this thesis, we have chosen some values of exponent n and reported the results of GA 
perfonnauce. However, we feci that this is an important parameter in terms of search 
[>ower of crossover. A low value of n signifies broad search and a high value a local search. 
It may bo interesting to carry out a parametric study on GA performance with respect 
to n. This oxj)onont can also be varied during the GA search, namely a low value in 
early generations, and a high value for later generations. 

3. Selection of other form of probability distributions for contracting and expanding crossovers 

The probability distribution for contracting crossover used in this thesis is a polynomial 
function to the spread factor. This distribution is used to simplify the computations 


81 



82 


required in each crossover operation. Other functional forms of the probability distribu- 
tions can iilso be used, but care must be taken to choose a distribution which is sirmlarin 
form to that given in figure 2.6. In this respect, a Gaussian-like distribution with mean 
at fi = I and variance (o"^) chosen to have at most ±3 ct points within a specified bormd 
can also be used. 

4. Real-coded mutation operator 

In this thesis, our main thrust was on investigation of the effect of the crossover operator 
mainly. Mutation operator was developed (see section 2.5) in chapter 2 but its effects are 
not investigated. A future study may incorporate investigation of effect of mutation and 
crossover both used together. Moreover, there can be other forms of mutation operator 
than that used in tills thesis. However, any such form should have the property that near 
points should be more probable to be generated than far-off points. 

5. Application of SEX to real-world optimization problems 

The present study shows that real-coded GAs are preferred to binary-coded GAs for 
problems with continuous search space and for problems where arbitrary precision is de- 
sired. For example, we have used real-coded GAs with SBX in a robotics path planning 
problem (Mnkorjee, Sharma, and Agrawal, 1995). Many engineering design optimization 
problems contain mixed variables — both discrete and continuous. The classical opti- 
mization methods do not perform very well in solving these kinds of problems (known as 
mixed integer programming problems). GAs with a combination of both binary coding 
(for discrete variables) and real coding (for continuous variables) can be used to solve 
these problems efliciontly. For example, SBX is currently being used in optimization of 
a structural truss (Chaturvedi, 1995). Traditionally, a truss is optimized in two phases. 
First, a fixed values of areas for its members is assumed and then its topology is op- 
timized. After suitable topologies have been found, optimization techniques are used 
to decide the areas of its individual members. But the optimization in two stages does 
not guarantee to lead to global optimal solution. Hence topology and areas should be 
optimized simultaneously. Thus, it becomes a mixed variable problem, because areas are 
continuous variables and topology can be represented by discrete variables. SBX with 
single point binary crossover is being used for this problem. The crossover operation is 
performed as follows. A variable is first chosen for crossover. H a discrete variable needs 
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to 1,0 0 ,' 0 , 00 < 1 , tl,o «inRlo.,,„i„l C.roso„,cr i. „,od to tho bits reprosentiog that variable 
only. Hill if a eoliliiiilmis variablo iiootls lo be crossed, the SBX operator is used. This 
way Uin msiilUiig , string al.s<j bccoiiK's a valid striiiR, 


(), Siinulatimj other hinary crossovers for real-coded GAs 


'Die SnX operalior (tevelu|)ed 


hen? is based on the probability distribution of children 


strings ill a single-point crossover. Similar probability distributions (a function of the 
spread factor /f) can also be obtained for multi-point and uniform crossovers and other 
similar real-coded crossovers can be designed. Probability distributions similar to the 


one (levelot)e(I here are expected, it may be interesting to investigate whether they turn 
out otherwise. Depending on the outcome of that analysis, the issue of superiority of one 
crossover ovi'r otlu'r can be resolved, particularly in the context of GA’s application to 
continuous variabli's. 


7. lh>piilation siziny and ('onvcryc.nr.c. proof 


Throughout in this study, reasonable values of the population size are used. As demon- 
strated elsewhere (Goldberg, Deb, and Clark, 1992), the proper population sizing is 
important for succi'ssful working of GAs. That study also suggested a population sizing 
based on the ratio of the did'erence in detectable function values to the variance of the 
function. An attempt can he nuule to derive a population sizing for real-coded (!As from 
a similar viewpoint. In binary-coded GAs, the major difficulty towards achieving a con- 
vergence proof lies in the mechanics of the crossover operator. Although the mechanics 
of tin? single-point crossover (or for that matter any other binary crossover operator) 
is simple, their mallie.matical modeling becomes difficult. Any convergence proof of a 
stocliivstic. process involves calculation of transition probabilities. Since in the SllX op- 
erator, a direct probability distribution of children points for any given pair of parent 
points i.s used, the proof of convergence of genetic algorithms may become easier. 


8. Mxillimodal and mulLiobjeA:tix)c opUxxiizalion usixig real-coded GAs 

I'lu? success of l)iiiary GAs in many single-objective, unimodal problems has led re- 
searchers to extend (lA’s applicability to other types of optimization problems-multimodal 
and multiohjective function optimization problems. In order to lind multiple optimal so- 
lutions simultaneously, sharing (mictions are used to create artificial fitness values, which 
are used in the r(?pro(lucliou operator (Deb, JDH!); CJotdberg and Richardson, 1987). In 
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order 1.) find nn.lUpIn pareto optuaal points for multiobjective functior, optimization, 
tim coticopt of Hondo, nination is used to create artificial fitness values, which are again 
used in tl,(' reproduction operator {Srinivas and Deb, in press). Since only reproduction 
operator needs modification to solve tlie above problems, similar reproduction tecluiiques 
can be used along with the SllX operator used in this study to investigate the efficacy 
of real-coded (Ms in multimodal and multiobjective problems defined on a continuous 
search S{)ar:e. 


5.2 Extensions (ainbiguous shape modelling) 

1. Model for n-liiik ease 

The present implementation uses MA'I' composed of 3-link as the basic input. In real- 
world applications, shape.s may he much more complicated involving large number of 
links in their MATs. Thus, a generalization of the present implementation is needed 
wliere tm)(l('l should l)e able to handle ii-link MAT. It is suggested that the routines 
developed in the present implementation can be used as basic computation systems to 
be a,pplied a,t each nodi; of n-link MAT. 


2. MAT model fivm a ‘sketeh’ of the. shape 

Right now, the us(U' has to input the model of QMAT specifying the uncertainities for 
each param(U.er. l'lv(m though this was satisfactory for some test cases as shown in the 
the.sis, it will be inconvenient for reai-world applications involving complicated shapes 
Thus, it is sugge.sted that the MA'I' information can be obtained from an input ‘sketch’, 
using a MA'I’-calciilation system based on any standard algorithm for creating voronoi 
diagram of line sites (fortune, 1987; 'I'iwari, 1980; Yap, 1987). Thus the user won’t need 
to even think of MAT at all. Instead of that, he will specify some initial rough sketch of 
shape, Hjxu'.ifyiiig his degree of iincertalnity. Based on this input, the model should be 
able to find out ti\e MA'F and its (lualitativc version, and feed it to tlic main program to 
generat(? a set of qualitative similar s. 


3. Enhanced user inle.rface. 

A good userrinterface is lacking in tliis work. The main CM program runs on UNIX 
based systems and tlie input data has to be given through keyboard/file. ^.I’he graphi 
display program runs on MS-DOS platform, and uses the datafiles generated by the 
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main piogram. Tim future iiupleumiitation should integrate all the subsystems of the 
model ou a single plateform. The user must be able to see various possible alternative 
shapes dynamically. Then lie/slm cati si.ecify (po.ssil)|y using a mouse) which one is most 
suitable for his application and the program should now take this shape as the input for 
next iteration. 

4. Oilier JiliK'fiN criteria 

In tlie present implementation, we. have kept a limited number of litness criteria for 
the sake of simplicity. There arc, of course, endless number of contour properties that 
could have been tackled by CA, for example convexity ratio ( area of polygon/ area of 
it’s convex hull), first and second moment of inertias, perimeter distance, noise (small 
Hhari) Corimr('(i edges) and other simitar i.ssues. Moreover, in the real-world applications, 
the list of lltnoHH criterion may bo much more than this one, because it will include 
some application-oriented criterion ahso, eg. strength of the part under certain load 
conditiona, manufacturability consideration etc. If future implementation is carried out 
in under OOFS (Object oriented programming system), the extension to the list of 
litness criterion may be easier. 


5. I'Jinpirirnl icHthuj of GA paramcievs 

In the te.st re.snlt.s pre.sented here, wo have kept a fixed value for each OA parameter. 
However, it may be a good idea to vary these parameters and investigate their effect 
on this particular application. It may be found that a particular set of input data may 
be best handled by a particular set of GA parameters. For example, problems involving 
large vaxiation in M A'l' parameters tend to become more complex, and may require larger 
population sixe.s. A liigh uintatiou probability may be suitable in the applications where 
user is more interested in diversity of the shapes rather than it’s optimization. A research 
on tliese i.ssnes may provide some empirical guidelines to user about what GA parameters 
to use for a pa.rtuuilar ajiplication. 



Chapter 6 

Summary and conclusions 


111 tliLs ilie.sifi, 


a r('al-r()(l(’(l croKKovor operator has been developed based on the search char- 


acteristics of a si iiftle point crossover used in hi 


nary-coded GAs. 


Ill order to define the search 


power of a crossovi'r opc'rator, a spread factor has been defined as the ratio of the absolute 
(lin'ereiices of the children points to that of the parent points. Thereafter, the probability of cre- 
ating a cliild point for two given (larent points lias been derived for the single-point crossover. 
Motivated by the success of binary-coded GAs in problems with discrete search space, simu- 
lated binary crossovi'r (HHX) operator luvs been developed to solve problems having continuous 
search siiac.e. 'I’he SilX operator has similar search power as that of the single-point crossover. 

On a miinber of ti'st functions including I)e Jong’s five test functions, it has been found 
that real ’C.odi'd (lAs witli tlii' HliX operator perform as good or better than binary-coded GAs 
with the single point crossover. In comparison with another real-coded crossover operator 
(IILX-O.f)) suggested ('Isewliere, SBX performs better in difficult test functions. It has been 
observed that HIIX is particularly useful in problems where the bounds of the optimum point 
is not known a priori and wlieri' ilime are imiltiplo optima, of wJiicli one is global. 

Ileal-coded GAs with the MIIX operator have been also used to solve a, blocked rmictioii, 
iillrodnc.ed by Goldberg (lf)|)l). Hlucked functions are di 111 cult for real-coded GAs, because 
local optimal points block the jirogress of search to continue towards the global optimal point. 
An active, two-sided blocked function has been designed and solved using GAs with the SBX 
operator. The simiilatioii results have shown that in most occasions, the search proceeds the 
way as predicted, but HliX ean overcome the blocking by the local optima,! points and converge 
to tlie global optimal solution. 

These results are encouraging and suggest avenues for further roseaich. Because tlic SBX 
operator uses a iiroliability distribution for cliooshig a child point, the real-coded GAs with 
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SI)X ill'' f’"'' *^*'*'1' ‘'1"''”^ ^'Ah ill Uiriiis of adiioving a convcrgcnco proof 

for (JAh- With iv dirocl, prohahilislic relationship between children and parent points used in 
this thesis, cik's from tlu* classical stochastic optimization methods can be borrowed to achieve 
a convorg<'nc('. proof of (lAs, or a imich closer tie between the classical optimization methods 
and genetic algorithms is on the, horizon. 

In the secoml part of the tlu'sis, a,ttention has been focused on a real-world application for 
GA with SBX. Here, a less-precise model (a hybrid qualitative model) is used to represent a 
large class of possible shapes. tlA with HBX has been used to find out desirable shapes from 
the. class of shapes reirresented by QMAT, and to remove infeasible or undesirable ones from 
consideration. 'The system is designed in such a way that a user can Identify his application 
oriented desirability critmia a.nd set the weight to each criterion accordingly. Several sample 
shaiK's (leveloiu'd by this system have h(>en shown, which reveal the usefulness of this system in 
conceptual (h'sign stage. Although this is intended to be an application test for GA, the problem 
is of HuHicient importance of its own in the, areas of shape approximation and conceptual design. 
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APPENDIX-I 


A i.rogram ‘iga,.c,’ was wrUioa in ‘C’ programming language to test onr SBX crossover 
approacli in this thesis. I'his is a GA implementation and have following features : 


0 Variable representation : Mixed (binary + real-coded). Can be used as purely binary 
GA or purely real-coded GA depending on user’s wish. 


0 Selection scheme : tournament selection (size of tournament can be fixed by user) 

0 Crossover : hor binary strings — - single point crossover 
For real numbers — SIIX / BLX-0.5 


0 Crossover approach for multi-variable problems : All three approaches of section 2.4.4 

0 Mutation : For binary strings — bitwise mutation 
For real numbes real-coded mutation of .section 2.5 


In addition to thi.s, the user can .sot following toggle switches : 

0 Variable bounds : fixed / (iexible 

0 Evaluation : with sluvring / without sharing 

0 lloport : end ro|)ort / generation-wise report 

One particular CA run is terminated if any one of the following criteria is satisfied ; 

1. Premature convergence. 

2. I'arget fitness value achieved. 

Global solution is acitievecl. 

d, Mitxiimim numix'r of generations over. 

5. Population has spread in a very wide range (specified by user). 

Since this program wns intended to be used for testing purjjose, it has the option of 
running the GA program many times (user can specify the number of runs) with dillerent 
seed numbers generated automatically in the program. A final report is prepared in the file 
‘realga.out’ for inspection. Following is a sample output file generated by the program. 


INITIAL REPORT 

iasec!H:3ns:aS!aat3niaaaaBSB!a!=S!BoBa» = at;t3n!=ann:=:=:=i====‘ 

CROSSOVER TYPE ! Binary GA (Singla-pt); Real GA (SBX) 
STRATEGY : Uniformly all variables 60 
TYPE OF BOUNDARIES : Rigid 


Population size : 60 

Total no, of generations : 40 

Cross over probability : 1,0000 

Mutation probability : 0.0100 



Sharing not to be done : 

String length : 15 

Number of veiriables 

Continuous ; 2 
Discrete : 1 
TOTAL : 3 
Total Runs to be performed : 4 
Epsilon for convergence : 0.001 

Epsilon for closeness ; 0.001 

Distribution index : 2 

Tournament size : 2 

Tcirget Objective fn. value : 0 

Lower and Upper bounds : 


-1.0000 

<= 

xl 

<= 

1.0000 

-1.0000 

<= 

x2 

<= 

1.0000 

-1.0000 

<a 

x3 

<= 

1.0000 


Run No. 

Best X 

Trials 

Spread 

Comment/Best String 

1. 

0.0008850 

1300 

-0.935789 to 

0.563219 


-0.0009773 


-0.009467 to 

0.172813 


-0.0002669 


-0.010862 to 

0.006025 100000000001110 


OBJECTIVE = 0.0000018 



Random Seed 

Number 

: 0.4000 

-> Target foimd 

2. 

-0.0008850 

1750 

-0.250771 to 

-0.000031 


0.0004291 


-0.006790 to 

0.006809 


0.0009418 


-0.004047 to 

0.008719 011111111110001 


OBJECTIVE = 0.( 

D000019 



Random Seed 

Number 

: 0.5200 

-> Target found 

3, 

0.0009461 

1800 

-0.980895 to 

0.750420 


0.0003447 


-0.006470 to 

0.003524 


-0.0006636 


-0.005220 to 

0.001924 100000000001111 


OBJECTIVE = 0.' 

0000015 



Random Seed 

Number 

: 0.6400 

-> Target found 

4. 

-0.0001526 

2050 

-0.516732 to 

0.999817 


-0.0006892 


-0.001662 to 

0.002052 


-0.0010936 


0.001370 to 

0.002423 011111111111101 


OBJECTIVE = 0 . 

0000017 



Random Seed 

Number 

: 0.7600 

-> Not Found 




Average trials given = 1616.7 

Population prematurely converged 0 times 
Total Successes,® 3 out of 4 
Average of best objective ® 0.000002 




me- 






